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Abstract 


-l- 

A  unique  algorithm  was  developed  to  recognize  isolated 
words  given  the  output  of  an  extremely  variable  feature 
extraction  process.  Because  of  the  high  error  rate  of  the 
acoustic  processor,  it  was  necessary  to  rely  on  the  consis¬ 
tency  of  the  sequences  of  phonemes,  and  the  errors  that 
typically  occur  for  a  given  word  to  determine  the  word  spo¬ 
ken.  This  was  accomplished  by  generating  error  statistics 
and  phoneme  representations  for  each  word  in  the  vocabulary 
using  a  set  of  training  speech  files.  The  top  five  phoneme 
choices  provided  by  the  acoustic  processor,  and  information 
indicating  the  accuracy  of  each  choice,  for  each  time  seg¬ 
ment  of  speech  was  implemented.  Fuzzy  set  theory  was  used 
to  combine  this  information  with  the  error  information 
obtained  from  the  training  files  to  determine  the  word  spo¬ 
ken.  ^Because  of  the  number  and  types  of  errors  present  in 
the  acoustic  output,  the  algorithm  produces  a  word  score 
indicating  the  plausibility  of  a  word  being  spoken  regard¬ 
less  of  the  number  of  errors  that  occurred.  To  recognize  a 
speech  file,  a  score  is  generated  for  each  word  in  the  voca¬ 
bulary,  and  the  word  with  highest  score  is  chosen  as  the 
word  that  was  spoken.  The  number  of  operations  required  to 
make  a  decision  increases  linearly  as  the  size  of  the  vocabu¬ 
lary  increases,  and  the  algorithm  can  readily  be  adapted  for 
real  time  speech  recognition.  The  algorithm  can  also  be 
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I .  Introduction 


In  the  past  30  years  there  has  been  a  significant  amount 
of  research  conducted  in  the  area  of  automatic  speech  recog¬ 
nition.  The  reason  so  much  attention  has  been  given  to  this 
area  is  that  the  possible  applications  for  a  speech  recogni¬ 
tion  machine  appear  to  be  unlimited.  The  list  of  possible 
applications  include  the  military,  industry,  office,  school, 
and  home  applications;  and  involve  almost  any  area  of  human 
endeavor  in  which  there  is  a  human-machine  interface.  Al¬ 
though  there  has  been  some  promising  results  in  this  area  of 
research,  automatic  speech  recognition  remains  a  distant  goal. 

Background 

There  are  many  reasons  why  there  are  currently  no  effi¬ 
cient  automatic  speech  recognition  systems,  but  perhaps  the 
major  reason  is  that  there  is  a  lack  of  appropriate  decision 
algorithms.  Speech  recognition,  as  with  any  other  pattern 
recognition  problem,  can  be  separated  into  two  processes:  a 
feature  extraction  process,  and  a  decision  process.  Figure 
1  illustrates  these  processes  for  speech  recognition.  First, 
features  are  extracted  from  the  speech,  and  then  these  fea¬ 
tures  are  input  into  the  decision  process  which  determines 
the  words  that  were  spoken.  Because  of  the  inability  of 
present  decision  algorithms  to  efficiently  determine  which 
words  spoken,  given  a  set  of  features  containing  errors,  many 
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Block  Diagram  of  Speech 
Recognition  Processes 


researchers  have  attempted  to  develop  feature  extraction  pro¬ 
cesses  that  would  perfectly  segment  speech.  However,  even 
if  a  perfect  feature  extraction  process  was  developed,  there 
would  still  remain  a  number  of  errors  in  its  output  since  it 
is  impossible  for  humans  to  perfectly  reproduce  any  known 
features.  This  fact  is  easily  verified  by  the  inability  of 
humans  to  repeatably  pronounce  any  word  exactly  the  same,  and 
by  the  fact  that  there  are  vast  differences  i’n  the  speech  of 
different  speakers.  Therefore,  the  research  effort  should 
not  be  confined  to  the  development  of  a  perfect  feature  ex¬ 
traction  process,  but  include  as  well  the  development  of  an 
efficient  decision  process  that  could  determine  the  words 
spoken  given  that  there  will  always  be  a  number  of  errors 
present  in  the  features  extracted  from  the  highly  variable 
input  speech. 

Problem 

The  objective  of  this  research  was  to  develop  a  decision 
process  which  would  be  capable  of  determining  the  words  spo¬ 
ken  from  a  set  of  reasonably  consistent  speech  features. 
Consistent  features  will  be  defined  as  features  that  appear 
repeatibly,  but  not  necessarily  always,  for  a  given  word. 

The  more  specific  objective  of  this  research  was  to  develop 
an  algorithm  which  would  recognize  the  words  from  the  fea¬ 
tures  extracted  by  the  acoustic  processor  developed  by  Karl 
Seelandt  (Ref  15) . 


The  features  which  Seelandt's  algorithm  extracts  are 
basic  units  of  speech  that  will  be  referred  to  as  phonemes. 
Although  the  term  "phoneme"  classically  implies  a  more  exact 
set  of  speech  segments,  this  term  will  be  used  here  to  refer 
to  any  segment  of  speech  that  is  defined  to  be  unique. 

The  output  of  Seelandt's  acoustic  processor  contains 
many  errors;  including  substitutions,  insertions,  and  dele¬ 
tions.  These  errors  result  from  the  inadequacies  of  his  al¬ 
gorithm,  and  by  the  omission  and  mutilation  of  phonemes  by 
speakers.  There  have  been  many  decision  algorithms  imple¬ 
mented  and  suggested  in  the  literature  which  attempt  to  over¬ 
come  problems  associated  with  errors  in  similar  outputs;  how¬ 
ever,  because  of  the  number  and  types  of  errors  present  in 
Seelandt's  output,  none  of  these  approaches  can  be  practi¬ 
cally  implemented.  (Refs  1,  7,  and  9) .  In  fact,  with  the 
possible  exception  of  the  algorithm  proposed  by  Kashyap  (Ref 
9) ,  there  are  currently  no  algorithms  which  are  capable  of 
making  decisions  in  a  practical  manner  when  there  are  more 
than  a  few  errors  in  the  data  being  examined. 

Scope 

The  recognition  algorithm  that  was  developed  in  this 
thesis  is  capable  of  recognizing  isolated  words,  or  phrases 
that  are  treated  as  words.  The  algorithm  is  completely  inde¬ 
pendent  of  the  vocabulary  in  the  sense  that  words  or  phrases 
can  be  added  or  deleted  at  any  time.  However,  the  accuracy 
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of  the  system  will  be  effected  by  the  vocabulary  chosen  since 
some  words  are  more  likely  to  be  confused  with  one  another 
than  with  others.  The  algorithm  can  be  used  as  either  a 
speaker  dependent  or  independent  system.  As  with  most  other 
recognition  systems,  the  accuracy  of  the  system  is  higher 
when  using  the  system  as  a  speaker  dependent  system  since 
the  output  of  the  acoustic  analyzer  is  more  consistent  for 
one  speaker.  The  only  information  that  is  used  in  determin¬ 
ing  the  words  spoken  is  obtained  from  the  acoustic  output. 

It  should  also  be  noted  that  no  attempt  was  made  to  improve 
Seelandt's  algorithm. 

General  Approach 

Because  of  the  high  error  rate  of  the  acoustic  processor, 
it  was  necessary  to  rely  on  the  consistency  of  the  sequences 
of  phonemes,  and  the  errors  that  typically  occur  for  a  word 
to  determine  the  word  spoken.  This  was  accomplished  by  gen¬ 
erating  error  statistics  and  phoneme  representations  for  each 
word  in  the  vocabulary  using  a  set  of  training  speech  files. 
The  training  files  were  created  by  applying  the  acoustic 
analyzer  to  a  number  of  speech  samples  for  each  word. 

The  top  five  phoneme  choices  provided  by  the  acoustic 
processor,  and  information  indicating  the  accuracy  of  each 
choice,  for  each  time  segment  of  speech  was  implemented  in 
the  decision  algorithm.  Fuzzy  set  theory  was  used  to  combine 
this  information  with  the  error  information  obtained  from 
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the  training  files  to  determine  the  word  spoken.  This  theory 
will  be  presented  in  Chapter  3. 

Because  of  the  number  and  types  of  errors  present  in  the 
acoustic  output,  it  was  necessary  to  develop  a  transition 
algorithm  capable  of  producing  a  word  score,  indicating  the 
plausibility  of  a  word  being  the  one  spoken,  regardless  of 
the  errors  that  occurred.  Using  this  algorithm,  a  score  is 
obtained  for  each  word  in  the  vocabulary,  and  the  word  with 
the  highest  score  is  chosen  as  the  word  that  was  spoken. 


Sequence  of  Presentation 


The  next  chapter  of  this  thesis  examines  the  format  and 
accuracy  of  the  acoustic  analyzer  developed  by  Seelandt.  It 
is  very  important  that  the  reader  study  this  chapter  care¬ 
fully  to  obtain  an  understanding  of  the  significance  of  the 
proposed  word  recognition  algorithm.  Chapter  3  presents  the 
basic  concepts  of  fuzzy  set  theory  that  are  employed  in  the 
recognition  algorithm  that  will  be  described  in  Chapter  4. 
Chapters  5  and  6  present  possible  algorithms  for  determining 
a  word's  phoneme  representation  and  optimizing  program  para¬ 
meters,  respectively.  Preliminary  results  are  given  in  Chap¬ 
ter  7,  and  general  remarks  and  conclusions  are  given  in  Chap¬ 
ter  8.  Chapter  9  provides  a  detailed  list  of  recommendations. 
The  appendixes  contain  a  recommended  approach  for  designing 
a  continuous  speech  understanding  system,  and  program  docu¬ 


mentation. 


II .  Acoustic  Output 


The  word  recognition  algorithm  developed  in  this  thesis 
relies  solely  on  the  output  of  the  acoustic  processor  devel¬ 
oped  by  Karl  Seelandt  (Ref  15) .  The  purpose  of  this  chapter 
is  to  examine  the  format  and  accuracy  of  this  output. 

Basically,  the  acoustic  processor  attempts  to  identify 
the  phonemes  that  occur  in  speech.  As  defined  in  Chapter  1, 
a  phoneme  is  a  basic  unit  of  speech,  combinations  of  which 
can  be  used  to  form  words.  The  set  of  phonemes  used  in  this 
thesis,  created  from  Karl  Seelandt' s  speech,  is  shown  in 
Table  I,  and  consists  of  71  distinct  segments  of  speech  that 
occur  in  the  words  "zero"  through  "nine".  However,  the  acous¬ 
tic  processor,  and  word  recognition  algorithm  to  be  presented, 
can  be  used  with  any  predefined  set  of  phonemes. 

Output  Format 

Figure  2  illustrates  the  acoustic  output  for  the  word 
"zero"  spoken  by  GJM.  The  first  column  of  the  output  pro¬ 
vides  the  vector  number  assigned  to  each  segment  of  speech. 

The  reason  that  the  first  vector  number  is  not  one  is  that 
the  noise  occurring  before  and  after  the  actual  speech  has 
been  removed.  The  next  five  columns  of  the  output  provide 
the  top  five  phoneme  choices,  and  their  respective  weights 
for  each  segment.  The  phoneme  chosen  is  given  by  the  first 
number  of  each  pair,  and  corresponds  to  the  phoneme  numbers 
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Sound 

From 

Phoneme 

Svmbol 

Word 

Number 

XX 

noise 

1 

Z1 

zero 

2 

Z2 

zero 

3 

Z3 

zero 

4 

ER1 

zero 

f> 

ER2 

zero 

6 

RA 

zero 

7 

UH 

zero 

8 

OU1 

zero 

9 

OH1 

zero 

10 

OH2 

zero 

11 

OU2 

one 

12 

WU1 

one 

13 

WU2 

one 

14 

AW 

one 

15 

OY1 

one 

16 

OY2 

one 

17 

NX1 

one 

18 

T1 

two 

19 

T2 

two 

20 

T3 

two 

21 

OU3 

two 

22 

OU4 

two 

23 

UA 

two 

24 

TH 

three 

25 

RX1 

three 

26 

REl 

three 

27 

RE  2 

three 

28 

TABLE  I  (cont'd) 


Sound 

Symbol 

From 

Word 

Phoneme 

Number 

EE1 

three 

29 

EE  2 

three 

30 

FXl 

four 

31 

FX2 

four 

32 

OH  3 

four 

33 

OR 

four 

34 

RX2 

four 

35 

PX3 

five 

36 

FX4 

five 

37 

WA 

five 

38 

AHl 

five 

39 

AH  2 

five 

40 

AY1 

five 

41 

AY  2 

five 

42 

VX1 

five 

43 

SX1 

six 

44 

SX2 

six 

45 

SX3 

six 

46 

IY1 

six 

47 

IY2 

six 

48 

KXl 

six 

49 

KX2 

six 

50 

SX4 

seven 

51 

EH1 

seven 

52 

EH  2 

seven 

53 

VX2 

seven 

54 

EH  3 

seven 

55 

NX2 

seven 

56 

Sound 

Symbol 

From 

Word 

Phoneme 

Number 

AE1 

eight 

57 

AE2 

eight 

58 

AE3 

eight 

59 

EE  3 

eight 

60 

T4 

eight 

61 

T5 

eight 

62 

NX3 

nine 

63 

AH  3 

nine 

64 

AH  4 

nine 

65 

AH  5 

nine 

66 

AH6 

nine 

67 

IE1 

nine 

68 

IE2 

nine 

69 

NX  4 

nine 

70 

XX 

noise 

71 

ZERO 

CJH 

THE 

DATE 

IS- 

-  8 

20 

1982 

THE 

TIME 

IS- 

-  20 

19 

24 

VECTOR 

FIRST 

SECOND 

THIRD 

FOURTH 

FIFTH 

SCALE 

NUnBER 

CHOICE 

CHOICE 

CHOICE 

CHOICE 

CHOICE 

FACTOR 

****** 

****** 

****** 

****** 

****** 

****** 

****** 

95 

21 

100 

63 

95 

56 

94 

70 

93 

43 

88 

*9131.530 

96 

21 

100 

43 

89 

63 

88 

70 

87 

32 

84 

,79304210 

97 

21 

100 

32 

94 

70 

93 

62 

92 

56 

90 

.86398080 

96 

70 

100 

21 

99 

62 

96 

56 

93 

43 

91 

,90292700 

99 

70 

100 

56 

92 

21 

91 

43 

86 

63 

87 

.85169500 

100 

70 

100 

56 

91 

63 

88 

62 

87 

21 

63 

.81483860 

101 

70 

100 

63 

87 

12 

82 

56 

82 

62 

81 

.77515350 

102 

70 

100 

63 

85 

12 

82 

62 

80 

21 

77 

.76327380 

103 

70 

100 

12 

81 

63 

81 

23 

78 

24 

78 

.75937090 

104 

70 

100 

12 

81 

13 

80 

23 

80 

24 

80 

.78226070 

105 

70 

100 

13 

81 

24 

78 

12 

77 

21 

/  6 

.79373070 

106 

70 
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24 

75 

21 

74 
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.83285790 
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70 
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28 

78 

13 

76 

43 

74 

63 

73 

.87437260 

10ft 
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28 

82 

43 

75 

13 

72 

21 

72 

.88921780 

109 

70 

100 

28 
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43 

78 
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73 

21 

69 

.89245720 

110 

70 
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77 

43 

75 

63 
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13 

66 
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76 

43 

74 
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80 
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77 

24 

76 

43 
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.75072310 

113 

70 
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84 
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81 

54 

81 

24 

80 

.68498410 
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86 
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85 
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83 
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91 
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88 
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93 
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92 
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90 
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89 
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85 

.85203830 

119 
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23 

94 

33 

93 

11 

92 
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90 

.94516810 

120 

34 
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33 

98 

23 

96 

12 

95 
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94 
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34 
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92 
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Fig.  2.  Acoustic  Output  for  Word  "Zero" 
Spoken  by  GJM 
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shown  in  Table  I.  The  weights  assigned  to  each  phoneme  have 
been  scaled  from  0  to  100  for  each  vector,  with  100  being 
assigned  to  the  best  choice.  The  last  column  indicates  the 
scale  factor  assigned  to  each  vector,  and  provides  a  measure 
of  the  accuracy  of  one  vector  compared  to  the  accuracy  of  the 
other  vectors  in  the  speech  file.  This  factor  has  values 
between  0  and  1,  with  1  corresponding  to  the  least  accurate 
vector. 

The  weights  and  scale  factors  given  in  the  acoustic  out¬ 
put  are  derived  based  on  the  distances  between  each  prototype 
phoneme  and  each  segment  of  speech.  The  set  of  distances, 

Di#  computed  for  each  vector  i,  can  be  represented  by  the  set 


Di  =  <dl,i' 


l2,i< 


*c,  i' 


*n,i> 


(1) 


with  dc  ^  the  distance  between  the  cth  prototype  phoneme  and 
vector  i,  and  n  the  number  of  prototypes.  The  equations  used 
to  compute  the  weights  for  the  pnoneme  choice,  X_  and  the 
scale  factor,  SF^  for  vector  i  are: 

xc,i  -  100  x  ((Max{Di>  -  dc  ^/(Max  {D± } 

-  Min  {D± >) )  (2) 

SFX  =  (Min{Di>)/Max  {Min^},  Min{D2}, 

Min{D3},  ...,  Min{Dt}}  (3) 


where  t  is  the  number  of  vectors  in  the  speech  file,  and  X  . 
is  the  weight  assigned  to  phoneme  PQ.  Note  that 
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Max^} 

Min{Di> 


=  Max{d 
=  Min{d 


1/  i' 

*2.1' 

-  dc,i' 

. ..,  d  . } 
n,  x 

(4) 

l,i' 

*2,1' 

-  dc,i' 

•  •  • ,  d  . ) 
n,  x 

(5) 

The  logic  for  scaling  the  phoneme  distances  between  0 
and  100,  and  for  providing  the  scale  factor  for  each  vector, 
rather  than  using  the  actual  distances,  is  to  make  the  weights 
for  the  phoneme  choices  for  a  given  vector  independent  of  the 
weights  assigned  to  the  phonemes  chosen  for  other  vectors. 

By  making  these  weights  independent  for  each  vector,  the  im¬ 
portance  of  the  value  of  the  minimum  distance  for  each  vector 
can  be  varied.  The  manner  in  which  this  is  done  will  be  dis¬ 
cussed  in  the  description  of  the  word  recognition  algorithm. 


Output  Accuracy 

To  provide  an  understanding  of  the  proposed  word  recog¬ 
nition  algorithm,  and  the  significance  of  this  algorithm,  it 
is  necessary  to  examine  the  accuracy  of  the  acoustic  output. 
Referring  to  the  phoneme  representation  for  word  "zero"  given 
in  Table  I  (phonemes  2  through  11) ,  and  to  the  sample  outputs 
for  the  word  "zero"  spoken  by  five  different  speakers,  shown 
in  Figures  3  through  7;  it  is  easily  seen  that  the  accuracy 
of  the  acoustic  processor  is  extremely  poor,  and  that  often 
the  correct  phonemes  are  not  even  in  the  output.  In  fact, 
the  accuracy  is  such  that  any  meaningful  error  percentage 
would  be  virtually  impossible  to  derive.  To  overcome  this 


13 


ZERO 


THE 

DATE 

IS¬ 

-  5 

25 

1982 

THE 

TIME 

IS- 

-  19 

1 

56 

VECTOR 

FIRST 

SECOND 

THIRD 

FOURTH 

FIFTH 

SCALE 

NUMBER 

CHOICE 

CHOICE 

CHOICE 

CHOICE 

CHOICE 

FACTOR 

****** 

****** 

****** 

****** 

****** 

****** 

****** 

90 

62 

100 

71 

93 

21 

89 

20 

85 

01 

61 

.81760670 

91 

02 

100 

21 

95 

62 

92 

20 

89 

03 

87 

.69882920 

92 

02 

100 

03 

84 

46 

83 

20 

75 

31 

74 

.81740700 

93 

19 

100 

02 

99 

03 

97 

20 

94 

44 

90 

.94243760 

94 

20 

100 

19 

89 

61 

85 

02 

84 

31 

82  . 

.82714840 

95 

20 

100 

03 

87 

44 

86 

46 

84 

02 

83 

.90076300 

9* 

20 

100 

61 

85 

02 

63 

03 

61 

30 

81 

.84040860 

97 

20 

100 

02 

99 

30 

93 

46 

93 

61 

93 

.96326630 

98 

20 

100 

46 

98 

02 

97 

31 

97 

21 

96 

.97433140 

99 

29 

100 

30 

97 

59 

96 

20 

94 

46 

94 

1.00000000 

100 

30 

100 

20 

93 

46 

92 

21 

89 

25 

87 

.96126880 

101 

30 

100 

21 

90 

60 

90 

46 

88 

70 

87 

.99318220 

102 

30 

100 

29 

90 

46 

88 

70 

86 

21 

86 

.97390340 

103 

29 

100 

30 

95 

46 

95 

69 

91 

60 

90 

.98903580 

104 

46 

100 

04 

96 

69 

96 

28 

95 

29 

95 

.97576470 

105 

69 

100 

04 

98 

22 

97 

46 

93 

28 

92 

.90866070 

106 

22 

100 

69 

98 

70 

94 

04 

92 

32 

90 

.80807590 

107 

70 

100 

22 

99 

69 

93 

32 

91 

04 

86 

.75795470 

108 

70 

100 

48 

97 

22 

96 

69 

87 

23 

83 

.75848590 

109 

70 

100 

23 

89 

63 

87 

48 

84 

22 

82 

.75575530 

110 

70 

100 

23 

89 

63 

87 

48 

82 

12 

78 

.78379240 

111 

70 

100 

27 

90 

23 

68 

63 

85 

06 

64 

.80849080 

112 

70 

100 

27 

97 

23 

90 

63 

88 

06 

87 

.81929250 

113 

70 

100 

27 

97 

23 

94 

63 

94 

34 

84 

.79022160 

114 

70 

100 

23 

91 

63 

90 

27 

87 

12 

83 

.72270700 

115 

70 

100 

23 

96 

63 

90 

07 

88 

12 

88 

.73743130 

•  ’6 

07 

100 

70 

99 

23 

98 

12 

93 

63 

90 

.79020180 

i  1 7 

07 

100 

12 

96 

23 

96 

70 

96 

63 

89 

.84055130 

118 

23 

100 

07 

98 

12 

98 

34 

96 

05 

94 

.89116410 

119 

34 
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23 

98 

12 

95 

05 

93 

07 

91 

.85788100 

120 

12 
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23 

99 

52 

97 

34 

93 

05 
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.89522270 

121 

52 
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55 

98 

23 

96 
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95 

08 

93 
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122 
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23 

98 
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97 

08 

96 

07 

95 

.88056220 

123 
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07 

97 

12 

96 

52 

94 

08 

92 

.84726150 

124 

23 

100 

12 

96 

07 

90 

08 

90 

09 

90 

.84974180 

125 

23 
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08 

97 

09 

94 

07 

91 

12 

90 

.83060780 

126 

08 
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09 

97 

23 

94 

07 

91 

12 

87 
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127 

23 
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08 

96 

07 

93 

12 

91 

34 

90 

.75337810 
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23 

100 

12 

95 

54 

93 

08 

89 

34 

89 

.79251320 

129 

08 

100 

35 

98 

38 

95 

07 

94 

09 

94 

.75838940 

130 

08 

100 

07 

95 

09 

94 

34 

90 

06 

89 

.68710600 

131 

12 

100 

07 

99 

06 

97 

33 

97 

24 

96 

.74905610 

132 

12 
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11 

99 

33 

95 

24 

92 

34 

92 

.77477740 

133 

11 

100 

33 

98 

10 

97 

12 

96 

24 

96 

.78694440 

134 

11 

100 

12 

99 

10 

97 

33 

97 

24 

92 

.76734290 

135 

12 

100 

33 

90 

11 

85 

24 

85 

62 

84 

,66368070 

136 

12 

100 

71 

93 

62 

90 

33 

88 

11 

87 

.71807770 

137 

62 

100 

12 

97 

71 

68 

33 

87 

11 

86 
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138 

62 
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12 

94 

71 

90 

56 

84 

70 

82 

.59088270 

139 
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97 

12 

94 

21 

86 

01 

83 

,64932520 

Fig.  3.  Acoustic  Output  for  Word  "Zero" 
Spoken  by  DCD 
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Fig.  4.  Acoustic  Output  for  Word  "Zero" 
Spoken  by  JEF 
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128 

39 

100 

41 

99 

07 

97 

10 

96 

52 

91 

.93706050 

129 

41 

100 

39 

99 

07 

95 

52 

94 

35 

93 

.93934140 

130 

41 

100 

39 

94 

52 

92 

07 

90 

10 

87 

.89274660 

131 

41 

100 

52 

98 

39 

97 

35 

91 

07 

90 

.93388740 

132 

41 

100 

52 

99 

39 

93 

07 

88 

35 

67 

.90551440 

133 

52 

100 

41 

95 

39 

92 

35 

65 

0/ 

84 

.85531030 

134 

52 

100 

41 

95 

39 

89 

35 

84 

07 

83 

.82273390 

135 

52 

100 

41 

91 

39 

82 

53 

80 

07 

79 

.76898570 

136 

52 

100 

41 

85 

39 

83 

53 

83 

07 

78 

.72995780 

137 

52 

100 

41 

83 

53 

83 

39 

60 

07 

76 

.68760880 

136 

52 

100 

53 

68 

41 

63 

39 

80 

07 

76 

,70787350 

139 

52 

100 

53 

91 

39 

86 

41 

82 

07 

79 

.75222250 

140 

52 

100 

53 

95 

39 

89 

41 

84 

09 

83 

.75002690 

141 

52 

100 

53 

97 

39 

92 

09 

90 

08 

89 

.83117820 

142 

39 

100 

52 

97 

08 

96 

07 

95 

53 

95 

.69778450 

143 

39 

100 

07 

96 

08 

96 

09 

95 

35 

95 

.83792300 

144 

39 

100 

07 

99 

08 

99 

09 

98 

33 

95 

,89643600 

145 

07 

100 

33 

97 

35 

9/ 

26 

92 

39 

92 

.85804230 

146 

07 

100 

35 

97 

33 

94 

09 

91 

11 

91 

.80053440 

147 

07 

100 

12 

95 

33 

94 

09 

93 

08 

92 

.85925790 

148 

07 

100 

33 

99 

12 

98 

11 

93 

26 

90 

.88728750 

149 

33 

100 

01 

97 

12 

97 

62 

97 

07 

96 

.90737920 

Fig.  5.  Acoustic  Output  for  Word  "Zero" 
Spoken  by  LMR 
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ZERO 

HJX 


THE  DATE  IS--  5  20  1982 
THE  TINE  IS--  2  54  39 


VECTOR 

FIRST 

SECOND 

THIRD 

FOURTH 

FIFTH 

SCALE 

NUMBER 

CHOICE 

CHOICE 

CHOICE 

CHOICE 

CHOICE 

FACTOR 

«**### 

*«»««* 

****** 

****** 

****** 

*«***« 

***«*» 

98 

03 

100 

01 

93 

71 

64 

02 

82 

62 

66 

.82128940 

99 

02 

100 

01 

89 

03 

64 

71 

79 

20 

72 

.81486480 

100 

01 

100 

02 

75 

19 

71 

03 

70 

71 

69 

.69117470 

101 

01 

100 

19 

61 

03 

75 

71 

67 

02 

64  . 

.69976470 

102 

01 

100 

03 

74 

19 

71 

71 

68 

02 

52 

.65773640 

103 

01 

100 

03 

88 

71 

66 

02 

65 

19 

65 

.63326690 

104 

03 

100 

01 

97 

02 

82 

19 

79 

71 

65 

.78161590 

105 

03 

100 

19 

93 

02 

91 

01 

83 

20 

74 

.83109990 

106 

03 

100 

19 

99 

20 

/5 

02 

73 

29 

65 

.73087350 

107 

03 

100 

19 

89 

20 

81 

29 

66 

59 

64 

.71808520 

108 

03 

100 

20 

93 

19 

75 

29 

71 

59 

66 

.78974500 

109 

20 

100 

03 

89 

44 

61 

30 

80 

61 

79 

.91133280 

110 

30 

100 

46 

99 

21 

93 

45 

92 

20 

91 

.99011790 

111 

70 

100 

04 

99 

30 

99 

45 

98 

21 

95 

1,00000000 

112 

04 

100 

70 

97 

28 

91 

22 

87 

30 

83 

.94437750 

113 

04 

100 

70 

98 

26 

91 

22 

87 

13 

81 

.93398200 

114 

70 

100 

04 

94 

28 

91 

22 

89 

48 

87 

.862835/0 

115 

70 

100 

48 

95 

28 

93 

22 

89 

04 

87 

.78195260 

116 

70 

100 

48 

97 

22 

93 

28 

90 

23 

84 

.74313770 

117 

70 

100 

48 

88 

22 

87 

23 

85 

28 

61 

.71269390 

118 

70 

100 

27 

89 

48 

89 

23 

86 

63 

82 

,73021480 

119 

70 

100 

27 

99 

23 

94 

06 

89 

48 

67 

.77017440 

120 

27 

100 

23 

94 

70 

93 

07 

92 

06 

89 

.75030090 

121 

27 

100 

23 

94 

07 

92 

70 

90 

06 

88 

.71738240 

122 

27 

100 

23 

95 

70 

91 

07 

89 

33 

68 

,68471430 

123 

23 

100 

27 

95 

33 

91 

70 

90 

07 

69 

.67089920 

124 

23 

100 

33 

92 

0/ 

68 

26 

87 

27 

87 

.63038190 

125 

23 

100 

33 

96 

07 

92 

26 

89 

06 

68 

.63559440 

126 

33 

100 

23 

96 

07 

95 

12 

90 

34 

90 

.64036700 

127 

33 

100 

07 

99 

34 

92 

23 

91 

06 

89 

.64739980 

128 

07 

100 

33 

95 

34 

92 

27 

91 

06 

87 

,63049130 

129 

07 

100 

34 

94 

27 

93 

33 

93 

35 

88 

.65319100 

130 

07 

100 

34 

96 

27 

94 

33 

93 

35 

93 

,69736360 

131 

07 

100 

34 

95 

33 

93 

35 

93 

08 

92 

,71064220 

132 

07 

100 

08 

98 

09 

98 

34 

96 

10 

93 

.72598510 

133 

08 

100 

09 

99 

07 

97 

10 

93 

34 

92 

.69765400 

134 

09 

100 

08 

98 

07 

96 

10 

93 

11 

91 

,69873360 

135 

09 

100 

06 

99 

10 

97 

07 

95 

11 

95 

,73280530 

136 

10 

100 

08 

99 

11 

98 

09 

97 

33 

97 

,73978330 

137 

12 

100 

08 

98 

33 

98 

11 

97 

09 

95 

,70654710 

138 

12 

100 

11 

96 

33 

94 

13 

89 

09 

8/ 

,61810920 

139 

12 

100 

11 

92 

33 

91 

13 

89 

09 

86 

,58621560 

140 

12 

100 

11 

92 

33 

92 

08 

91 

09 

91 

,65384550 

141 

12 

100 

11 

92 

07 

91 

23 

91 

33 

91 

.66156430 

142 

12 

100 

11 

90 

23 

90 

33 

90 

10 

88 

,64918850 

143 

12 

100 

23 

98 

11 

96 

10 

94 

33 

93 

,68718060 

144 

23 

100 

12 

97 

11 

94 

24 

91 

07 

88 

,69230060 

145 

12 

100 

11 

94 

23 

92 

33 

92 

34 

88 

.68498370 

146 

12 

100 

62 

97 

33 

96 

11 

95 

01 

90 

,74130050 

Fig.  6.  Acoustic  Output  for  Word  "Zero" 
Spoken  by  MJK 
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ZERO 


THE 

DATE 

IS¬ 

-  5 

29 

1982 

THE 

TIME 

IS- 

-  14 

19 

23 

VECTOR 

FIRST 

SECOND 

THIRD 

FOURTH 

FIFTH 

SCALE 

NUMBER 

CHOICE 

CHOICE 

CHOICE 

CHOICE 

CHOICE 

FACTOR 

****** 

****** 

****** 

****** 

****** 

****** 

****** 

ao 

03 

100 

19 

92 

02 

89 

01 

83 

20 

68 

.83642510 

ai 

03 

100 

19 

95 

20 

83 

02 

78 

01 

67 

.87908170 

82 

03 

100 

20 

92 

19 

63 

44 

76 

29 

74 

.96659320 

83 

20 

100 

61 

92 

25 

88 

51 

84 

46 

78 

.98333780 

84 

32 

100 

20 

97 

21 

90 

18 

86 

46 

85 

,96238590 

85 

70 

100 

21 

93 

63 

90 

04 

87 

18 

87 

.85514750 

86 

70 

100 

21 

94 

04 

89 

23 

89 

16 

86 

.87223290 

87 

21 

100 

70 

93 

28 

92 

04 

66 

18 

86 

.90579780 

88 

28 

100 

22 

97 

60 

93 

21 

91 

70 

91 

.96419890 

89 

60 

100 

22 

99 

30 

92 

70 

90 

16 

89 

.95117830 

90 

22 

100 

60 

95 

30 

94 

28 

91 

69 

88 

.93124870 

91 

69 

100 

22 

90 

30 

94 

28 

93 

70 

93 

1.00000000 

92 

70 

100 

28 

97 

69 

91 

22 

87 

48 

86 

.944997/0 

93 

70 

100 

48 

86 

04 

84 

63 

84 

28 

83 

.82626210 

94 

70 

100 

63 

86 

48 

82 

04 

79 

22 

77 

.79448350 

95 

70 

100 

63 

92 

48 

91 

43 

60 

04 

79 

.79253020 

96 

70 

100 

63 

96 

48 

92 

05 

65 

23 

85 

.81578730 

97 

63 

100 

70 

99 

23 

98 

54 

94 

26 

92 

.84678410 

98 

63 

100 

23 

99 

26 

97 

70 

97 

27 

94 

.80866450 

99 

26 

100 

23 

99 

06 

96 

70 

96 

54 

94 

.77236000 

100 

26 

100 

07 

97 

54 

97 

23 

96 

06 

95 

.73679290 

101 

07 

100 

06 

98 

26 

98 

27 

96 

23 

94 

.72435360 

102 

06 

100 

26 

95 

07 

94 

27 

94 

35 

88 

.69644150 

103 

27 

100 

06 

98 

35 

98 

26 

95 

07 

90 

.73300430 

104 

35 

100 

27 

95 

06 

91 

07 

66 

26 

86 

.67980830 

105 

35 

100 

27 

97 

07 

95 

06 

93 

34 

66 

.74359270 

106 

07 

100 

35 

96 

27 

92 

55 

92 

06 

91 

.73468580 

107 

07 

100 

35 

92 

27 

89 

26 

87 

06 

66 

.71449790 

108 

07 

100 

35 

96 

27 

91 

34 

90 

26 

88 

.75898530 

109 

35 

100 

34 

97 

07 

96 

27 

90 

06 

88 

.75825920 

110 

35 

100 

34 

96 

06 

91 

07 

90 

39 

90 

.71297000 

111 

35 

100 

34 

99 

07 

95 

08 

95 

39 

95 

.79819630 

112 

07 

100 

35 

94 

39 

94 

08 

93 

54 

93 

,76658480 

113 

07 

100 

08 

92 

23 

92 

26 

92 

09 

90 

.72521780 

114 

07 

100 

08 

97 

23 

93 

09 

91 

34 

91 

.75838840 

115 

08 

100 

34 

97 

07 

96 

23 

94 

12 

93 

.77299700 

116 

08 

100 

12 

99 

34 

97 

11 

94 

09 

93 

,75455530 

117 

12 

100 

11 

89 

34 

89 

08 

88 

09 

85 

.63196660 

118 

12 

100 

54 

86 

11 

82 

24 

80 

33 

80 

.54840840 

119 

12 

100 

54 

83 

11 

79 

24 

78 

33 

78 

.46845190 

120 

12 

100 

62 

66 

54 

81 

70 

79 

11 

78 

.47375940 

121 

12 

100 

62 

93 

70 

84 

21 

83 

54 

82 

.53143450 

122 

12 

100 

62 

98 

70 

87 

21 

84 

54 

80 

.58056580 

123 

62 

100 

12 

96 

21 

89 

01 

84 

71 

84 

.63793120 

124 

62 

100 

01 

99 

71 

90 

12 

86 

21 

76 

.63102670 

Fig.  7.  Acoustic  Output  for  Word  "Zero" 
Spoken  by  RLC 
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error  rate  it  was  necessary  to  rely  on  the  consistency  of 
the  errors  that  are  made,  and  the  sequences  of  phonemes  that 
are  output  for  each  word.  Obviously,  this  consistency  does 
not  exist  for  the  first  phoneme  choices  for  a  word.  Because 
of  this  fact,  more  choices  were  examined,  and  it  was  found 
that  although  not  obvious,  consistencies  do  appear  to  exist 
in  the  top  choices  of  the  output.  Although  5  choices  were 
used  in  this  thesis,  the  word  recognition  algorithm  can  eas¬ 
ily  be  altered  to  consider  any  number  of  choices. 

At  present,  there  are  no  algorithms  known  to  the  author 
which  can  account  for  the  number  of  errors  present  in  this 
output,  or  that  can  identify  the  non-obvious  consistencies 
that  appear  to  exist.  Also,  there  are  no  known  algorithms 
currently  available  which  can  efficiently  use  the  information 
provided  in  a  number  of  alternate  choices  for  each  segment 
of  speech.  Therefore  it  was  necessary  to  develop  a  unique 
algorithm  which  could: 

1.  account  for  a  large  number  of  errors  in  the  output; 

2.  determine  the  consistencies  that  occur  in  the  out¬ 
put  to  recognize  the  word  spoken;  and 

3.  efficiently  use  the  information  provided  by  a  num¬ 
ber  of  alternative  choices  for  a  given  vector. 

These  objectives  were  accomplished  using  fuzzy  set 
theory,  which  will  be  presented  in  the  following  chapter. 
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Ill .  Fuzzy  Set  Theory 


Introduction 

The  concept  of  Fuzzy  Set  Theory  was  first  introduced  by 
Lofti  A.  Zadeh  in  1965  (Ref  19) .  Since  then  numerous  arti¬ 
cles  have  been  published  concerning  the  algebraic  properties, 
and  possible  applications  of  this  theory.  However,  the  suc¬ 
cesses  of  fuzzy  set  theory  have  been  extremely  limited.  Per¬ 
haps  the  major  reason  that  accounts  for  the  limited  success 
of  this  relatively  new  theory,  is  that  the  applications  for 
which  it  appears  to  be  suited  deal  with  problems  where  there 
have  only  been  limited  successes.  For  example  the  areas  of 
pattern  recognition  and  Artificial  Intelligence  appear  to  be 
ideal  fields  for  the  application  of  this  theory,  and  at  pres¬ 
ent  there  are  no  methods  which  appear  to  be  superior  to  any 
other  in  either  of  these  areas. 

Fuzzy  Set  Theory 

Fuzzy  set  theory,  as  the  name  implies,  provides  a  theory 
for  sets  in  which  the  transition  from  membership  to  non-mem¬ 
bership  is  not  clearly  defined  (fuzzy)  (Ref  20) .  For  example, 
the  set  of  people  who  are  young  is  not  clearly  defined,  and 
the  transition  from  membership  to  non-membership  can  not  be 
clearly  established.  To  account  for  this  fuzziness,  the  mem¬ 
bers  of  a  fuzzy  set  are  assigned  grades  of  membership  in  the 
interval  from  zero  to  one.  A  fuzzy  set,  S,  will  then  be  of 
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the  form: 


S  =  {xj/ufxj),  x2/u(x2),  xi/u(xi) 


Vu(xn)  > 


(6) 


where  x^  is  an  element  of  set  S  with  a  grade  of  membership 
of  u(xi)  in  the  set.  Continuing  the  example  of  the  set  con¬ 
sisting  of  young  people,  one  possible  fuzzy  set  could  be 


S (young)  =  {10/1.0,  25/0.9,  35/0.8,  50/0.5,  75/0.01}  (7) 

where  the  ages,  or  elements,  of  the  set  are  10,  25,  35,  50, 
and  75;  and  their  respective  grades  of  membership  are  1.0, 
0.9,  0.8,  0.5,  and  0.01. 

The  grades  of  membership  can  be  assigned  either  subjec¬ 
tively,  as  the  ones  shown  above,  or  may  be  assigned  by  using 
a  mathematical  equation.  An  example  of  an  intuitive  equation 
that  could  be  used  to  determine  the  grades  of  membership  of 
the  elements  in  the  set  of  young  people,  taken  from  Ref  17, 
is  shown  below. 


u^)  =  (1,  for  xA  <  =  25, 

=  {{1  +  {(xi-25)/5) 2)_1,  for  x±  >  25  (8) 

where  u(x^)  is  the  grade  of  membership  of  the  element  x^  in 
the  set  S (young). 

Algebraic  Properties 

From  the  discussion  presented  thus  far,  it  may  appear 
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that  fuzzy  set  theory  is  an  extremely  informal  mathematical 
tool?  however,  since  the  introduction  of  this  theory,  there 
have  been  numerous  papers  written  which  have  attempted  to 
provide  a  more  formal  theory.  Some  of  the  formal  algebraic 
properties  that  have  been  suggested  include: 

Equality:  Two  fuzzy  sets,  A  and  B,  are  equal  if  and 

only  if  uA(x)  =  uB(x)  for  all  x,  where  x  is  an  element 
of  the  set  of  all  elements. 

Containment:  A  fuzzy  set  A  is  contained  in,  or  is  a 
subset  of  a  fuzzy  set  B,  written  A  <  B,  if  and  only  if 
uA (x)  <=  uB(x)  for  all  x. 

Complementation:  A'  is  the  complement  of  the  fuzzy  set 
A  if  and  only  if  uA,  (x)  =  1  -  uA(x),  for  all  x. 

Intersection:  The  intersection  of  the  fuzzy  sets  A  and 
B,  denoted  by  A  A  B,  is  given  by 

VB<X>  =  14111  (uaM'  «b<x>>  for  a11  x  (9) 

and  is  defined  as  the  largest  fuzzy  set  contained  in 
both  A  and  B. 

Union:  The  union  of  the  fuzzy  sets  A  and  B,  denoted 

A  U  B,  is  given  by 

u*ttd*x)  =  Max  (u.(x),  u  (x))  for  all  x  (10) 

AUB  A  a 

and  is  defined  as  the  smallest  fuzzy  set  containing 


both  A  and  B. 


An  extensive  examination  of  these  properties  and  others  that 
have  been  proposed,  can  be  found  in  the  references  (Refs  8 
and  19) . 

Although  formalization  is  generally  useful,  strict  adher- 
ance  to  the  formal  theory  that  some  have  proposed,  may  signi¬ 
ficantly  diminish  its  advantages  and  possible  applications. 

For  instance,  the  operators  "Min"  and  "Max"  are  used  exten- 
sively  for  the  sake  of  formality.  Although  these  operators 
are  definitely  of  value,  the  author  shares  the  view  of  Dubois 
and  Prade  who  state 

"...  the  choice  of  an  operator  is  always  a  matter  of 
context,  (and)  mainly  depends  upon  the  real  world  situation 
which  is  to  be  modeled.  In  other  words,  all  mathematical 
properties  regarding  the  class  of  fuzzy  set  theoretic  oper¬ 
ators  must  be  interpreted  at  an  intuitive  level."  (Ref  2). 

Therefore,  any  operator  which  appears  reasonable,  such  as  the 
"average"  or  "product"  operators  defined  below,  should  also 
be  considered  when  developing  a  fuzzy  algorithm  (i.e.  an 
algorithm  employing  fuzzy  set  theory) • 

Average ;  The  average  of  two  fuzzy  sets  A  and  B,  to  pro¬ 
duce  fuzzy  set  C,  is  given  by 

uc(x)  =  (uA(x)  +  uB(x))/2  for  all  x  (11) 

Product;  The  product  of  two  fuzzy  sets  A  and  B,  to  pro¬ 
duce  fuzzy  set  C,  is  given  by 

uc(x)  =  UA(X)  x  u  (x)  f°r  x 


(12) 


Decision  Making  Using  Fuzzy  Sets 


An  important  application  of  fuzzy  set  theory  is  in  mak¬ 
ing  decisions  when  a  number  of  factors  must  be  considered. 
Ronald  R.  Yager  has  written  two  excellent  articles  describing 
the  application  of  fuzzy  set  theory  to  multiple  objective 
decision-making  (Refs  17  and  18) .  The  decision  scheme  that 
follows  is  similar  to  the  one  proposed  by  Yager,  with  a  few 
notable  exceptions. 

Perhaps  the  best  way  to  present  the  proposed  decision 
scheme  is  by  means  of  an  example.  Assume  that  a  businessman 
must  decide  what  job  to  take  given  a  choice  of  4  jobs  located 
in  the  following  cities: 

A.  Anchorage,  Alaska; 

B.  Boston,  Massachusetts; 

C.  Chicago,  Illinois; 

D.  Denver,  Colorado. 

He  decides  that  he  will  base  his  decision  on  the  following 
three  factors: 

PI:  salary  vs  cost  of  living; 

F2:  advancement  opportunities;  and 

F3:  his  wife's  preference. 

After  researching  the  cost  of  living  in  each  city,  and  the 
possible  employers,  and  after  numerous  "discussions"  with 
his  wife,  he  subjectively  arrives  at  the  following  fuzzy  sets 
relating  each  job  to  each  of  the  three  factors. 
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(13) 

(14) 

(15) 


F±  =  {A/0.4,  B/ 0.5,  C/0.6,  D/0.9} 

F2  =  {A/0.8,  B/0.9,  C/0.7,  D/0.5} 

F3  =  {A/0.01,  B/0.3,  C/0.6,  D/0.7} 

If  he  assumes  that  all  three  factors  are  equally  important, 
then  one  method  he  could  employ  to  arrive  at  a  decision  would 
be  to  choose  the  job  provided  by  the  maximum  of  the  intersec¬ 
tion  of  the  three  sets.  Evaluating  the  intersection,  which 
was  defined  previously,  gives 

D  =  Fl/'F2/'F3  =  {A/0.01,  B/0.3,  C/0.6,  D/0.5}  (16) 

therefore  the  job  located  in  Chicago  would  be  the  best  choice. 

There  are  basically  two  problems  with  this  approach. 
First,  in  all  liklihood,  the  importance  of  each  factor  rela¬ 
tive  to  the  other  factors,  is  different;  and  second,  this 
method  essentially  only  considers  the  worst  factor  for  each 
of  the  four  jobs.  Before  continuing  the  example,  a  discus¬ 
sion  of  how  these  deficiencies  can  be  resolved  will  be  given. 

The  first  inadequacy  can  be  eliminated  by  applying  an 
equation  to  each  fuzzy  set  being  considered.  For  example, 
each  grade  of  membership  in  a  set  can  be  divided  by  a  con¬ 
stant  and  raised  to  power  as  shown  below  for  an  arbitrary 
fuzzy  set  A. 

(A/e)f  =  {x1/[(u(x1)/e)  f  ]  ,  x2/[  (u(x2)/e)  f  ] ,  ..., 

x  /[(u(x  )/e)f]}  (17) 

n  n 
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where  e  and  f  will  be  referred  to  as  fuzzy  variables,  and 
the  expression  (A/e) **f  will  be  referred  to  as  a  fuzzy  equa¬ 
tion.  By  varying  the  values  of  the  fuzzy  variables  for  dif¬ 
ferent  fuzzy  sets  involved  in  a  decision,  the  designer  can 
vary  the  importance  of  each  factor  on  the  final  decision. 

The  second  problem  discussed  above  can  be  resolved  by 
employing  an  operation  which  considers  all  the  factors  in  the 
final  decision,  such  as  the  average  or  product  operations 
presented  earlier.  Therefore,  by  using  an  operation  such  as 
the  average  operation,  and  by  using  fuzzy  equations  the  im¬ 
portance  of  each  factor  on  the  final  decision  can  be  adjusted, 
and  all  the  factors  can  be  considered. 

It  should  be  noted  that  any  number  of  operations  other 
than  the  average  or  product  operations  could  be  used,  and  a 
variety  of  fuzzy  equations  could  be  used  that  would  also  over¬ 
come  the  deficiencies  discussed  above.  An  interesting  example 
of  possible  fuzzy  equations  that  could  be  implemented  (which 
is  similar  to  contrast  intensification  discussed  in  reference 
21)  would  be  any  equation  that  would  use  one  set  of  fuzzy 
variables  when  the  grade  of  membership  is  above  a  specified 
threshold,  and  would  use  another  set  of  variables  when  the 
grade  of  membership  is  below  the  threshold.  As  an  example 
of  how  such  an  equation  would  be  valuable,  consider  the  fac¬ 
tor  Pi  of  the  above  example.  If  the  salary  offered  for  one 
of  the  jobs  is  below  some  level,  then  obviously  this  factor 
should  be  weighted  heavily  in  the  final  decision;  however,  if 


all  the  salaries  offered  are  within  a  reasonable  range  then 
the  importance  of  this  factor  may  become  negligible. 

One  question  that  has  not  been  addressed  is  how  the 
fuzzy  variables  should  be  determined  to  produce  the  best  de¬ 
cision.  This  of  course  depends  on  the  decision  being  made, 
and  the  variables  may  be  chosen  arbitrarily  or  by  some  appro¬ 
priate  algorithm.  One  approach  would  be  to  derive  the  vari- 

» 

ables  based  on  the  performance  of  the  decision  scheme  for 
similar  situations  where  the  results  are  known.  In  the  job 
example,  the  businessman  could  survey  colleagues  who  were 
faced  with  similar  decisions  for  choosing  a  job,  and  ask  them 
how  satisfied  they  feel  they  would  have  been  had  they  taken 
the  other  jobs.  Using  this  information  a  set  of  variables 
could  be  derived  based  on  the  set  that  would  have  produced 
the  most  satisfaction  for  his  colleagues. 

Continuing  the  previous  example,  assume  the  businessman 
decides  to  use  the  average  operator  and  the  fuzzy  equation 
discussed  earlier.  After  generating  a  reasonable  set  of 
fuzzy  variables,  and  applying  the  fuzzy  equation  to  each  fac¬ 
tor,  the  fuzzy  set  for  each  factor  become 


(Fl/1) 0,5 

=  {A/0.63, 

B/0.71,  C/0.77,  D/0.95} 

(18) 

(F2/2) 2*° 

=  {A/0.16, 

B/0.20,  C/0.12,  D/0.06} 

(19) 

(F3/1) 0,4 

=  {A/0.06, 

B/0.62,  C/0.82,  D/0.87} 

(20) 

Then  applying  the  average  operator  gives 


(FI  +  F2  +  F3)/3  =  (A/0.28,  B/0.51,  C/0.51,  D/0.63}  (21) 


and  therefore  the  job  located  in  Denver  would  be  the  best 
choi ce . 

General  Remarks 

In  this  chapter  the  basic  concepts  of  fuzzy  set  theory 
were  presented,  and  a  simple  decision  scheme  was  proposed 
for  making  decisions  when  a  number  of  factors  must  be  con¬ 
sidered.  As  demonstrated  in  the  remainder  of  this  thesis, 
this  theory  can  provide  a  basis  for  formalizing  an  otherwise 
heuristic  approach  to  a  problem.  Thus,  the  formal  properties 
of  fuzzy  sets  suggested  in  the  literature,  should  only  be 
adhered  to  when  there  is  at  least  an  intuitive  motive  for 
doing  so.  Because  of  the  ability  of  this  theory  to  account 
for  the  imprecisions  that  are  involved  in  decision  making, 
this  theory  may  provide  an  ideal  theory  applicable  to  many 
Pattern  Recognition  and  Artificial  Intelligence  problems  that 
currently  have  no  solution. 


28 


IV.  Word  Recognition  Algorithm 


Introduction 

This  chapter  presents  the  decision  algorithm  used  to 
determine  the  word  spoken  given  the  acoustic  output  discussed 
in  Chapter  2.  The  algorithm  relies  solely  on  the  information 
provided  by  the  acoustic  processor,  and  on  fuzzy  statistic® 
derived  from  applying  the  recognition  algorithm  to  a  set  of 
training  speech  files,  to  produce  a  word  score  indicating  the 
plausibility  of  a  word  being  the  one  actually  spoken.  Be¬ 
cause  of  the  error-prone  and  variable  nature  of  the  acoustic 
output,  it  was  necessary  to  develop  an  algorithm  which  would 
produce  a  word  score  regardless  of  the  number  or  type  of 
errors  made.  Although  it  may  be  possible  to  limit  the  number 
of  words  considered,  the  decision  scheme  currently  computes 
a  score  for  every  word  in  the  vocabulary  to  determine  the 
word  spoken.  A  possible  approach  for  limiting  the  number  of 
words  considered  will  be  suggested  in  the  recommendations. 

The  word  recognition  algorithm  can  basically  be  divided 
into  the  application  of  three  sub-algorithms. 

1.  A  vector  scoring  algorithm  which  generates  a  score 
indicating  the  plausibility  of  a  word  phoneme  occurring  for 
a  given  vector  of  the  acoustic  output. 

2.  A  transition  algorithm  which  determines  the  word 


phoneme  that  was  most  likely  uttered  for  each  vector  of 


speech,  along  with  the  most  likely  error  that  occurred. 

3.  An  algorithm  which  determines  the  total  word  score 
based  on  the  results  of  the  previous  sub-algorithms,  and  on 
any  factors  not  previously  considered. 

Each  of  these  sub-algorithms  will  be  discussed  in  this  chap¬ 
ter. 

To  derive  a  meaningful  word  score,  the  decision  algo¬ 
rithm  must  take  into  account  a  number  of  factors.  This  is 
accomplished  using  the  concepts  of  fuzzy  set  theory  presented 
in  Chapter  3.  This  theory  provides  a  method  to  formally  pre¬ 
sent  the  algorithm,  and  provides  valuable  insight  required  to 
improve  or  expand  the  recognition  scheme. 

As  mentioned  earlier,  the  algorithm  uses  fuzzy  statistics 
based  on  past  recognition  attempts  to  compute  the  word  scores. 
These  statistics  are  generated  from  the  error  information.,  and 
results  obtained  from  the  recognition  algorithm.  A  thorough 
discussion  of  fuzzy  statistics  will  be  presented  in  the  next 
section. 


Fuzzy  Statistics 

The  name  "fuzzy  statistics"  is  derived  from  the  fact 
that  the  statistics  used  by  the  word  recognition  algorithm 
are  determined  by  applying  a  fuzzy  algorithm.  This  algorithm 
will  be  presented  at  the  end  of  this  chapter  since  it  requires 


information  obtained  from  applying  the  recognition  algorithm. 


The  statistics  are  collected  on  a  word  basis  from  a  set  of 


training  files  given  for  each  word.  The  logic  for  collecting 
statistics  for  each  word  rather  than  collecting  overall  sta¬ 
tistics  follows: 

1.  The  problem  of  choosing  one  universal  phoneme  repre¬ 
sentation  for  a  word  that  would  be  valid  for  a  variety  of 
speakers,  and  that  would  account  for  slight  variations  in  a 
word's  pronunciation  is  extremely  difficult;  but  can  be  some¬ 
what  overcome  by  collecting  separate  statistics  for  each  word. 
Also,  a  set  of  overall  statistics  may  become  distorted  from 
variations  peculiar  to  specific  words. 

2.  There  is  a  possibility  that  a  word's  phoneme  repre¬ 
sentation  is  erroneous,  and  word  based  statistics  can  be  used 
to  automatically  correct  errors  in  the  representation.  This 
is  in  fact  the  problem  that  arose  in  this  thesis.  The  output 
of  the  acoustic  processor  was  so  variable  that  the  word 
phoneme  representation  that  provided  the  highest  accuracy  was 
completely  different  from  the  expected  representation. 

3.  Because  of  memory  constraints,  it  would  be  impracti¬ 
cal  to  collect  overall  phoneme  statistics  based  on  the  sur¬ 
rounding  phonemes  (for  other  than  a  small  phoneme  set) .  By 
collecting  statistics  on  a  word  basis,  this  problem  is  over¬ 
come  since  the  position  of  each  word  phoneme  is  fixed.  The 
reason  that  it  is  desirable  to  have  statistics  for  each 
phoneme  based  on  the  surrounding  phonemes,  is  that  the  sur¬ 
rounding  phonemes  may  have  a  major  effect  on  the  types  of 


errors  that  occur  for  particular  phonemes.  Although  this 
problem  can  also  be  solved  by  using  appropriate  phonological 
rules,  these  rules  may  not  be  always  valid,  and  are  difficult 
to  implement. 

Figures  8  through  12  illustrate  the  data  that  are  main¬ 
tained  for  each  word  (actual  data  are  for  the  word  "zero") . 
Figure  8  provides 

-  the  number  of  training  speech  files  that  were  used  to 
determine  the  fuzzy  statistics  for  the  word; 

-  the  number  of  recognition  attempts  that  were  made  for 
the  word; 

-  the  number  of  times  the  word  was  correctly  recognized; 

-  the  average  recognition  score; 

-  the  average  difference  between  the  word's  recognition 
score  and  the  next  highest  score  (only  computed  when 
word  is  correctly  recognized) ;  and 

-  the  minimum  difference  between  the  word's  recognition 
score  and  the  next  highest  score  (also  only  computed 
when  word  is  correctly  recognized) . 

Except  for  the  number  of  training  files  used  to  generate  sta¬ 
tistics,  none  of  this  information  is  used  in  the  recognition 
algorithm;  however,  this  information  is  used  by  the  routine 
for  optimizing  the  fuzzy  variables,  and  also  serves  as  an 
indication  of  the  algorithm's  performance. 

Figure  9  illustrates  the  overall  and  word  fuzzy  variables 


THE  STATISTICS  FOR  ZERO  FOLLOW. 
THE  SPEAKER  IS  "CJM1* 


Fig 


#  OF  TIMES  SPOKEN  =  5 

#  OF  RECOGNITION  ATTEMPTS  =  6 

#  OF  TIMES  CORRECTLY  RECOGNIZED  =  6 

AVERAGE  RECOGNITION  SCORE  =  0.987 

AVERAGE  DIFFERENCE  BETWEEN  WORD  SCORE 
AND  NEXT  HIGHEST  SCORE  =  0.098 

MINIMUM  DIFFERENCE  BETWEEN  WORD  SCORE 
AND  NEXT  HIGHEST  SCORE  =  4.5E-02 


8.  Performance  Data  Maintained  for  Each 

Word  (Data  Shown  is  for  the  Word  "Zero") 


THE  OVERALL  FU22Y  VARIABLES  THAT  WERE  USED  FOLLOW 


STHR  = 

1. 0E+00 

SUBE  = 

1 . 0E+00 

SUBF  = 

5.0E-01 

INSE  = 

1 .3E+00 

INSF  = 

5. 0E-01 

DELE  = 

1 . 0E+00 

DELF  = 

8. 0E-01 

DELG  = 

1.0E-01 

DCNE  = 

1. 0E+00 

DCNF  = 

1.2E+00 

DCNG  = 

5.0E-01 

SFE  = 

2.0E+00 

SFF  = 

2. 0E+00 

CHVE  = 

4. 0E+00 

CHVF  = 

2.5E-01 

STATE = 

1 . 0E+00 

STATF= 

3. 0E+00 

STATG= 

0.0E+00 

THR1E= 

1 . 0E+Q0 

THR1F= 

7.5E-01 

THR2E= 

1.0E+00 

THR2F  = 

5.0E-01 

THE  WORD  FU22Y  VARIABLES  FOLLOW 


WSTHR  = 

S.0E-01 

WSUEE  = 

1. OE+OO 

WSUBF  = 

5.0E-01 

W1NSE  = 

1 .3E+00 

WINSF  = 

5.0E-01 

WDELE  = 

1 ♦ 0E+00 

WDELF  = 

8. 0E-01 

WDELG  = 

1.0E-01 

WDCNE  = 

1 «  OE+OO 

WDCNF  = 

1 .2E  +  00 

WDCNG  = 

5.0E-01 

WSFE  = 

2'.  0E+00 

WSFF  = 

2. OE+OO 

WCHVE  = 

4. OE+OO 

WCHVF  « 

2.5E-01 

WSTATE® 

1. OE+OO 

WSTATF= 

3, OE+OO 

WSTATG* 

7.0E-01 

WTHR1 E= 

1. OE+OO 

WTHR1F= 

7.5E-01 

WTHR2E= 

1 . OE+OO 

WTHR2F= 

5.0E-01 

Fig.  9. 

Example 

of  the  Overall  and 

Word  Fuzzy  Variables 


#  TIMES  EACH  WORD  PHONEME  IS  DELETED 


PHONEME  REP  =  21  70  7  12  62 

#  DELETED  =  40010 


Fig.  10.  Deletion  Statistics  Generated 
For  the  Word  "Zero" 


#  TIPIES  PHOHEHE  X  SUBSTITUTE!;  FOR  WORD  rHQNEKE 


WORD  F'HOnEH H3 


X 

21 

70 

7 

12 

62 

1 

i 

0  #000 

0.439 

0.000 

0.000 

1,934 

2 

0  #000 

0.000 

0.000 

0.000 

0.000 

3 

0.000 

0.000 

o.ooc 

0,000 

0.000 

4 

0.452 

1.921 

0.000 

0 » COG 

0,000 

5 

0.000 

0.514 

0.507 

0.000 

0.000 

6 

0.000 

1.366 

2.637 

0.000 

0,000 

7 

0.000 

1.521 

4.057 

2.012 

0.556 

& 

0.00O 

0.000 

3.277 

2.322 

0.000 

9 

0.000 

0.000 

2.628 

2.092 

0.000 

10 

0.000 

0.000 

2.095 

1,782 

0,000 

11 

0.000 

0.493 

1.601 

1.993 

0.475 

12 

0.459 

2.772 

1.065 

1.467 

1.016 

13 

0.000 

1.706 

0,994 

0.522 

0.000 

14 

0,000 

0,000 

0,000 

0.000 

0,000 

15 

0.000 

0.000 

0.000 

0.000 

0,000 

16 

0.000 

0,000 

0,000 

0.000 

0.000 

1/ 

0.000 

0.000 

0.000 

0.000 

0.000 

18 

0.469 

0.431! 

0.000 

0.000 

0.000 

19 

0.000 

0.000 

0.000 

0 . 0  C  •) 

:  ooo 

20 

0.000 

0,000 

0.000 

0.000 

0.000 

21 

0.841 

0.948 

0,000 

0.000 

0.000 

22 

0.000 

0,479 

0.000 

0.000 

0.000 

23 

0.437 

2.830 

2.165 

0.000 

0,000 

24 

0.000 

1.933 

0,000 

1.150 

1.396 

25 

0.000 

1.014 

0.000 

0,000 

0.000 

28 

0,000 

1.931 

1,639 

0.565 

0.  *99 

27 

0.000 

3.180 

3.272 

0.000 

0,000 

28 

0 . 0  0  0 

1.351 

0 . 000 

0.000 

0.000 

29 

0.000 

0.000 

0.000 

0.000 

0,000 

30 

0,000 

0,000 

0.0  00 

0.000 

0,000 

31 

0,000 

0,000 

0.000 

0,000 

0,000 

32 

0.612 

0.000 

0.000 

0.000 

0.000 

33 

0.000 

2.467 

3.370 

1.430 

0.934 

34 

0,000 

0.000 

4,106 

3.3*2 

2.054 

35 

0,000 

0.000 

3.771 

0.654 

0.000 

i 


»  TIMES  PHOKEME  X  SUBSTITUTED  FOP  WORD  t-'HONEPiE 


WORD  PH0KEPIE3 


X 

21 

/  0 

7 

12 

62 

36 

0.000 

0 . 000 

0.000 

0 . 000 

0 .  0o0 

37 

0.000 

0,000 

0.000 

0.000 

0.000 

38 

0.0O0 

0.000 

2.599 

1.422 

0.4S8 

39 

0.000 

0.000 

1.456 

0.00  0 

0 . 0  0  0 

40 

0,000 

0.000 

0.000 

0.000 

0.000 

41 

0.000 

0.000 

0.000 

0.000 

o.coo 

42 

0.000 

0.465 

0,000 

0 . 0  0  0 

0.000 

43 

0.592 

1.003 

0,000 

0.000 

0,000 

44 

0.000 

0.000 

0,000 

0.000 

0.000 

45 

0.000 

0.000 

0.000 

0.000 

0.000 

46 

0.000 

0.000 

0.000 

0.000 

0.000 

47 

0.000 

1.024 

0.000 

0.000 

0,000 

46 

0.000 

1.466 

0.000 

0.000 

0.000 

49 

0.000 

0.000 

0.000 

0.000 

0.000 

50 

0.000 

0.000 

0.000 

0 .  oot 

0.000 

51 

'  0.000 

0,000 

0,000 

0.000 

0,000 

52 

0.000 

0.000 

0,000 

0 . 000 

0.000 

53 

0.000 

0,000 

0.000 

0.000 

0.000 

54 

0.000 

1.024 

0.546 

1.706 

1.632 

55 

0.000 

0.000 

0.100 

0.000 

0,000 

56 

0.763 

1.030 

0.000 

0.000 

0.469 

57 

0.000 

0.000 

0.000 

0.000 

0.000 

53 

0.000 

0,000 

0.000 

0.000 

0,000 

59 

0.000 

0.000 

0.000 

0.000 

0.000 

60 

0.000 

0.000 

0,000 

0,000 

0.000 

61 

0,000 

0.4*3 

0,000 

0.000 

0.000 

62 

0.439 

2,005 

0,000 

0.615 

4,085 

63 

0 ,644 

2.635 

.>,000 

0,000 

0,000 

64 

0,000 

0.000 

0.000 

0 . 000 

0.000 

65 

0.000 

0.000 

0 ,  coo 

0,000 

0.000 

66 

0,000 

0.000 

0,000 

o .  000 

0 , 0  0  0 

67 

0,000 

0.000 

0 . 0  C  0 

0,000 

0.000 

68 

0.000 

0,000 

0.000 

0,000 

0.000 

69 

0,000 

0.463 

o.oco 

0.000 

0.000 

70 

0.864 

4.446 

1.528 

0.000 

0.000 

71 

0,000 

0.450 

0.000 

0.000 

2.163 

Fig.  11  (Cont'd) .  Substitution  Statistics  for  Word 

"Zero"  for  Prototype  Phonemes 
36  Through  71 


#  TIRES  PHOKERE  X  INSERTED  FOR  WORD  PHQNERE 


WORD  PHONERES 


X 

21 

70 

7 

12 

62 

1 

0.425 

0.000 

0.000 

0.000 

0.000 

2 

0.000 

0.000 

0.000 

0.000 

0.000 

ft 

0 

0.79* 

0.000 

0.000 

0.000 

0.000 

4 

0.455 

0.000 

0 » C  Oo 

0.000 

0.000 

* 

j 

O.OoC 

0 . 000 

0,000 

0.000 

0.000 

6 

0.000 

0.000 

0.342 

0.000 

0.000 

7 

0.000 

0,000 

0 . 000 

0,000 

0.000 

a 

0.000 

0.000 

2,153 

0,536 

0.000 

9 

0.000 

0.000 

1.976 

0,674 

0.000 

10 

0.000 

0.000 

0.566 

0.532 

0.000 

11 

0.000 

0.000 

0.569 

0.568 

0.000 

12 

0.45a 

0.000 

0.597 

0.000 

0.000 

13 

0.000 

0.000 

0.000 

0,000 

0.000 

14 

0.000 

0,000 

0.000 

0.000 

0.000 

15 

0.000 

0.000 

0,000 

0.000 

0.000 

16 

0.000 

0.000 

0.000 

0.030 

0.000 

17 

0.000 

0.000 

0.000 

0,000 

0.000 

18 

0.000 

0.000 

0,000 

0 .000 

0.000 

19 

0.000 

0.000 

0.000 

0.000 

0.000 

20 

0.488 

0,000 

0.000 

0.000 

0  ♦  'jQO 

21 

0.000 

0.000 

0,000 

0.000 

0.000 

22 

0.000 

0.000 

0.000 

0.000 

0 . 000 

23 

0.443 

0.000 

0,574 

0.000 

0.000 

24 

0.430 

0.000 

0,000 

0.000 

3.000 

25 

0.000 

0.750 

0.000 

0.000 

0.000 

26 

0.000 

0.000 

0.000 

0.000 

0.000 

27 

0.000 

0,000 

0.574 

0.000 

0.000 

28 

0.000 

0.000 

0,000 

0,000 

0.000 

29 

0 .  coo 

0,000 

0.000 

0.000 

o.ooc 

30 

0.000 

0.000 

0.000 

0.000 

0.000 

31 

0,474 

0.477 

0.000 

0,000 

o.ooc 

32 

0.479 

0,000 

0.000 

0,000 

0.000 

33 

0.000 

0,000 

1.696 

0,562 

0.000 

34 

0.000 

0.000 

1.484 

0.000 

0.000 

35 

0.000 

0,000 

1,367 

0,000 

0,000 

Fig.  12.  Insertion  Statistics  for  Word  "Zero" 
for  Prototype  Phonemes  1  Through  35 
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tt 

Tines  phohehe 

X  INSERTED  FOR 

WORD  PHONENE 

WORD 

PHONE 

HES 

X 

21 

70 

7 

12 

62 

36 

0.000 

0.000 

0.000 

0.000 

0,000 

37 

0 . 000 

0.000 

0,000 

0,000 

0,000 

ae 

0.000 

0.000 

1.956 

0.000 

0.000 

39 

0,000 

0 , O00 

0.577 

0.000 

0.00(7 

40 

0.000 

0.000 

0.000 

0.000 

0.000 

41 

0.000 

0.000 

0,000 

0.000 

0.000 

42 

0.000 

0.000 

0,000 

0.000 

0,000 

43 

0,504 

0.000 

0.000 

0.000 

0.000 

44 

0./99 

0.460 

0.000 

0.000 

0.000 

45 

0.000 

0.000 

0.000 

0.000 

0.000 

46 

1.165 

0.000 

0.000 

0.000 

0.000 

47 

0.000 

0.000 

0.000 

0.000 

0.000 

48 

0.000 

0 . 000 

0.000 

0.000 

0.000 

49 

0.000 

0.000 

0.000 

O.OOt 

0.000 

50 

0.000 

0.000 

0.000 

0.000 

0.000 

51 

0.000 

0.000 

0.000 

0.000 

0.000 

52 

0.000 

0.000 

0.000 

o.coo 

0.000 

53 

0.000 

0,000 

0.000 

0 , 000 

0.000 

54 

0.000 

0.000 

0.574 

0,000 

0.000 

55 

0.000 

0,000 

0.000 

0.000 

0,000 

56 

0.771 

0.477 

0,000 

0.000 

0.000 

57 

0.000 

0.000 

0.000 

0,000 

0.000 

58 

0,000 

0.000 

0.000 

0.000 

0.000 

59 

0.000 

0.000 

0.000 

0.000 

0.000 

60 

0.000 

0,000 

0,000 

0.000 

0.000 

61 

0,500 

0.486 

Q.OCO 

0.000 

0.000 

62 

0,504 

0.000 

0.000 

0.000 

0,000 

63 

0,530 

0.000 

0.000 

0  ■  OOO 

o.coo 

64 

0.000 

0,000 

0,000 

0,000 

0.000 

65 

0.000 

0,000 

0.000 

0 . 000 

0.000 

66 

0.000 

0,000 

0,000 

0,000 

0.000 

67 

0.000 

0,000 

C,  000 

Q.O0C 

0.000 

66 

0.000 

0,000 

0.000 

0.000 

o.coo 

69 

0,000 

0.000 

0,000 

O.GOU 

0  f  0  0 

70 

0.869 

0,000 

0.000 

0.000 

0.000 

71 

0.465 

0,000 

0.000 

0.000 

0.000 

Fig.  12  (Cont'd).  Insertion  Statistics  for  Word  "Zero" 

for  Prototype  Phonemes  36  Through  71 
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that  were  employed.  The  overall  fuzzy  variables  are  used 
only  to  generate  fuzzy  statistics,  and  remain  constant  for 
all  the  words.  The  word  fuzzy  variables  are  used  only  when 
attempting  to  recognize  the  word  spoken,  and  vary  for  each 
word  in  the  vocabulary.  Some  of  the  reasons  for  using  sepa¬ 
rate  sets  of  variables  to  generate  statistics  and  to  recog¬ 
nize  words  include: 

1.  It  is  likely  that  a  word's  fuzzy  variables  will  be 
changed  often,  and  if  the  statistics  were  not  computed  from 

a  separate  set  of  variables  then  they  would  have  to  be  recom¬ 
puted  each  time  the  word  variables  are  changed. 

2.  Using  separate  sets  of  fuzzy  variables  allow  the 
effect  of  different  factors  to  be  biased  when  the  statistics 
are  generated.  For  example,  the  effect  of  insertions  (defined 
below)  can  be  minimized  when  collecting  statistics  for  the 
purpose  of  minimizing  the  number  of  insertions  that  occur 
when  a  word  is  scored  by  the  recognition  algorithm.  Also, 

the  effect  of  statistics  can  be  eliminated  or  minimized  when 
generating  the  statistics. 

The  reason  for  using  a  separate  set  of  word  fuzzy  vari¬ 
ables  for  each  word  is  that  the  importance  of  different  factors 
may  vary  for  different  words.  For  example,  one  word  may  have 
a  larger  number  of  pronunciations  than  another,  and  therefore 
the  importance  of  insertions  should  be  different  for  each 
word.  Another  reason  for  using  word  dependent  variables,  is 
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that  it  is  easier  to  optimize  the  variables  on  a  word  basis. 

A  short  description  of  each  fuzzy  variable,  that  is  the  same 
for  both  word  and  overall  variables,  is  provided  in  Appendix 
A. 

There  are  three  types  of  error  statistics  that  are  gen¬ 
erated  for  each  word  based  on  its  phoneme  representations  sub¬ 
stitution  statistics,  insertion  statistics,  and  deletion  sta¬ 
tistics.  A  substitution  "error"  occurs  when  the  word  phoneme 
being  examined  is  among  the  phoneme  choices  provided  by  the 
acoustic  processor,  or  when  a  phoneme  with  a  sufficiently 
high  possibility  of  being  substituted  for  the  word  phoneme 
(as  indicated  by  the  substitution  statistics)  is  among  the 
choices.  Note  that  the  term  "error"  does  not  necessarily 
mean  that  an  error  actually  occurred.  If  neither  of  these 
conditions  is  met  for  a  given  vector,  then  an  insertion  error 
is  said  to  have  occurred.  A  deletion  error  occurs  when  a 
word  phoneme  is  not  represented  by  any  vectors  in  the  acoustic 
output,  or  when  a  substitution  has  not  been  made  for  the  word 
phoneme.  A  word  phoneme  may  be  considered  deleted  regardless 
of  the  number  of  insertions  errors  that  occurred  for  it.  A 
more  formal  definition  of  these  errors  will  be  given  in  the 
next  section. 

Figures  10,  11,  and  12  provide  the  format  for  the  dele¬ 
tion,  substitution,  and  insertion  statistics,  respectively 
(for  the  word  zero) .  The  deletion  statistics  simply  indicate 
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the  number  of  times  each  word  phoneme  was  deleted  when 
attempting  to  recognize  the  word's  training  files.  The  sub¬ 
stitution  and  insertion  statistics  indicate  the  plausibility 
of  any  phoneme  being  substituted  or  inserted,  respectively, 
for  a  given  word  phoneme.  As  previously  mentioned,  the  man¬ 
ner  in  which  these  statistics  are  computed  will  be  given  at 
the  end  of  this  chapter. 


Vector  Scoring  Algorithm 


This  section  presents  the  vector  scoring  algorithm  used 
to  compute  a  score  indicating  the  plausibility  of  a  given 
word  phoneme  occurring  for  a  vector  of  the  acoustic  output. 

The  word  phoneme  score  for  each  vector  i,  depends  upon  the 
following  factors: 

Fl^:  the  weight  assigned  to  each  phoneme  by  the  acoustic 
processor  for  each  vector  i; 

F2:  the  scale  factor  assigned  to  each  vector  by  the 

acoustic  processor; 

F3w(j)s  the  plausibility  of  a  phoneme  being  substituted 
for  a  given  word  phoneme,  w(j) ;  and 

F4w(j) :  the  plausibility  of  a  phoneme  being  inserted 

for  a  given  word  phoneme,  w(j). 

The  fuzzy  sets  for  each  of  these  factors  will  now  be 
discussed.  Factor  F]^  is  represented  by  the  fuzzy  set 
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where  n  is  the  number  of  phonemes,  Pc  is  the  cth  prototype 
phoneme,  and  Xc  ^  is  the  grade  of  membership  of  Pc  determined 
by  the  expression 

X_  .  =  (weight  assigned  to  phoneme  P^  by  the  acoustic 
processor) /100 ,  if  Pc  is  among  the  choices  pro¬ 
vided  for  vector  i; 

X  .  =0,  otherwise.  (23) 

C,  2.  v 

Factor  F2  is  represented  by  the  fuzzy  set 


F2  =  {V1/SF1,  V2/SF2j 


Vi/STi,  .  ..,  Vt/SFt} 


where  is  the  ith  vector  of  the  acoustic  output,  t  is  the 
total  number  of  vectors,  and  SF^  is  the  scale  factor  assigned 
to  vector  i  by  the  acoustic  processor. 


Factor  F3. 


w(j) 


is  given  by  the  fuzzy  set 


F3w(j)  ~  {P1/Sl,w(j)'  P2/S2,w(j) . Pc/Sc,w(j)'  *•* 

P/S  }  (2! 

n  n,  w  ( j ) 


where  S  is  the  grade  of  membership  of  P  in  the  set 

C  /  W  V  J  /  c 

given  by 


Sc,w(j>  =  PSo,w(j)/<nosE’oke  -  ndelw(j)' 


where  PSc  is  the  plausibility  of  phoneme  Pc  being  sub¬ 

stituted  for  word  phoneme  w(j)  (given  by  the  statistics 
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discussed  earlier) ,  nospoke  is  the  number  of  training  files. 


and  ndelw^j)  is  the  number  of  times  word  phoneme  w(j)  was 
deleted  in  the  training  files.  The  format  of  PSC  w(j)>  no- 
spoke,  and  ndelw(jj  was  discussed  in  the  previous  section. 


Factor  F4 


w(j) 


is  given  by  the  fuzzy  set 


F4w(j)  -  {Pi/1!  ufiW  Po/I 


l/J-l,w(j)'  r2/J-2,w(j) . Pc/Ic,w(j)' 


P  /I  ,  . .  } 
n  n,  w  ( j ) 


(27) 


where  this  set  is  identical  to  F3  ,  .»  except  that  I  ,f/,i  is 

w(j)  c  c,w(j) 


the  grade  of  membership  of  Pc  in  the  set  given  by 


I-  =  PI  /nospoke 

c,w(j)  c,w(j)' 


(28) 


where  PIc  w^jj  is  the  plausibility  of  phoneme  Pc  being  in¬ 
serted  for  word  phoneme  w(j) .  PI  ...  was  also  discussed 

C/  W  V  J  / 

in  the  previous  section. 

Using  the  fuzzy  sets  for  each  of  the  factors  discussed 
above,  the  vector  scoring  algorithm  can  be  separated  into  the 
calculation  of  two  intermediate  fuzzy  sets  which  are  then 
combined  to  form  the  fuzzy  set  used  to  determine  the  vector 
score.  The  first  intermediate  set,  fuzzy  set  is  obtained 

from  sets  Fli  and  F2,  and  is  based  solely  on  the  output  of 
the  acoustic  processor.  The  resulting  fuzzy  set  is  of  the 


form: 


where  the  grades  of  membership,  a„  s ,  are  given  by  the  expres- 

w  /  X 


sion 


a  =  [1  -  ((1  -  X  . ) /chve) chvf ]  x 
c,  1  c,  1 

[1  -  (SF,/sfe)sff  ]  ,  for  X  .  >  0 
1  c,  1 

a  .  =  0,  for  X  .  =  0  (30) 

C, 1  C, 1 

where  X  .  is  the  grade  of  membership  of  P  in  set  Fl . ,  SFi 
is  the  grade  of  membership  of  in  set  F2,  and  where  chve, 
chvf,  sfe,  and  sff  are  fuzzy  variables  used  to  alter  the 
effect  of  the  factors  Fl.^,  and  F2  on  the  final  vector  score. 
Refer  to  Appendix  A  for  the  upper  and  lower  limits  of  the 
fuzzy  variables. 

The  second  fuzzy  set  is  computed  depending  on  whether  a 
substitution  or  insertion  error  occurs  for  vector  i  given 
word  phoneme  w(j),  and  depends  only  on  the  fuzzy  statistics. 
By  definition,  a  substitution  error  occurs  if  the  grade  of 
membership,  X^j)  ^  for  phoneme  pw(j)  in  set  Fl.^  is  not  zero, 
or  if 

sc,w(j)  >  sthr  *  sw(j> ,w(j>  for  ai,y c- 

l<=c<=n,  in  set  F3w^j  (31) 


where  sthr  is  a  fuzzy  variable  used  to  determine  the  "substi¬ 
tution  threshold"  for  the  word  being  examined,  and  where 
Sw(j)  represents  the  plausibility  of  word  phoneme  w(j) 

being  substituted  for  itself  (sw(j),w(j)  is  not  necessarily 
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equal  to  one) .  If  a  substitution  error  does  not  occur  for 
vector  i,  then  by  definition  the  vector  is  considered  to  be 
an  insertion  vector  (a  vector  where  an  insertion  error 
occurs) . 

If  a  substitution  error  occurs,  then  the  second  inter¬ 
mediate  fuzzy  set,  of  the  form 


Bw(j)  "  {Pl/bl,w(j)'  P2/b2,w(j) . Pc/bc,w(j) . 

P  /b  ,  }  (3 

n  n,  w  ( j ) 


is  generated  by  assigning  the  grades  of  membership  of  the  set 
using  the  equation 

bc,w(j)  =  1  -  [<1  -  sc,w(J))/state1**  statf'  for  xc, i  >  0 

-  o,  for  Xci  =  0  (33) 

where  X_  ..  is  defined  previously  for  set  FI .  ,  and  where  state 
and  statf  are  fuzzy  variables  used  to  vary  the  importance  of 
statistics  on  the  vector  score.  The  fuzzy  set  which  indicates 
the  plausibility  of  each  prototype  phoneme  replacing  the  word 
phoneme  w(j),  for  vector  i,  is  then  given  by 
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(35) 
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where  "Ave"  indicates  that  the  average  of  the  members  of  the 
set  is  being  computed. 

Transition  Algorithm 

Figure  13  illustrates  the  transition  algorithm  used  to 
arrive  at  a  word  score  from  the  acoustic  output.  The  circles 
in  this  figure  represent  the  phonemes  in  a  word's  phoneme 
representation,  and  the  arrows  represent  transitions  from 
one  vector  of  the  acoustic  output  to  the  next.  A  transition 
can  occur  regardless  of  whether  a  substitution  or  insertion 
error  occurred  for  a  given  vector.  The  arrows  between  the 
circles  indicate  the  possible  transitions  that  can  occur 
between  word  phonemes.  The  vector  scoring  algorithm  dis¬ 
cussed  in  the  preceeding  section  is  used  to  determine  where 
the  transitions  between  word  phonemes  occur. 

For  each  vector,  the  vector  scores  for  the  current  and 
next  two  word  phonemes  are  computed.  If  the  vector  score  for 
the  current  word  phoneme  is  the  highest,  then  this  phoneme 
remains  the  current  one?  however,  if  either  of  the  next  two 
word  phonemes  have  a  higher  vector  score,  then  a  test  must 
be  performed  to  determine  if  the  vector  is  a  transition  be¬ 
tween  word  phonemes  or  an  error.  This  test  consists  of  aver¬ 
aging  the  vector  scores  of  each  of  the  three  word  phonemes 
for  a  specified  number  of  following  vectors  (nsum) ,  and 
applying  one  of  the  following  two  tests  depending  on  which 
word  phoneme  had  the  highest  score. 
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Transition  Diagram  Indicating  the  Possible  Transitions 
That  Can  Occur  Between  Word  Phonemes 


If  the  word  phoneme  directly  following  the  current 
word  phoneme  had  the  highest  vector  score  then  this  test  is 
applied. 

Test  1: 


thrle 


i+nsum 
((  Z 
k=i 


k,w(j) 


)  /  (nsum+1) ) 


thrlf 


i+nsum 

X^i  Vk,w(j+1) 


) / (nsum+1) 


(39) 


o 


then  w(j)  remains  the  current  word  phoneme; 
else  phoneme  w(j+l)  becomes  the  current  word  phoneme, 
where  i  is  the  number  of  the  current  vector,  w(j)  is  the  num¬ 
ber  of  the  current  word  phoneme,  nsum  is  the  number  of  vec¬ 
tors  to  be  averaged,  Vk  w^nj  is  the  vector  score  computed 
for  word  phoneme  w(n)  for  vector  k  using  the  vector  scoring 
algorithm,  and  thrle  and  thrlf  are  fuzzy  variables  used  to 
minimize  the  possibility  of  making  a  transition  between  word 
phonemes  when  an  error  actually  occurred. 

If  the  second  word  phoneme  following  the  current  one  had 
the  highest  vector  score,  then  the  following  test  is  used. 
Test  2: 

,  i+nsum  ,  ,, 

If  thrle  x  ((  i  V.  ,  ..)/ (nsum+1)  )tftrit  >= 

K  /  w  v  j  / 

i+nsum 

(k£i  vk.w(j+D)/(nsum+1) 
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i+nsum 

and  thr2ex{(  Z  V  ,  . .  )  /  (nsum+1) ) >= 
k=i  k,wij; 


i+nsum 

(k^  Vk,W<j+2))/(nsum+1) 


then  w(j)  remains  the  current  word  phoneme; 


else  if 


i+nsum  i+nsum 

Sjii  Vk,w<j-M>)/(nsum+1)  ”  thrlex<(k^  vk,w(j)>/ 


(nsum+1) 


thrlf 


i+nsum  thr2f 

and  thr2ex((  Z  V  ,  > )/ (nsum+1) )  >= 

K/Wvj+i; 


i+nsum 

^  Vk,w(j+2) )/(nsum+1) 


then  w(j+l)  becomes  the  current  word  phoneme; 
else  w(j+2)  becomes  the  current  word  phoneme, 
where  thr2e  and  thr2f  are  fuzzy  variables  used  to  minimize 
the  possibility  of  making  a  transition  from  the  current  word 
phoneme  to  the  second  word  phoneme  following  it  (deleting 
the  phoneme  inbetween) ,  when  an  error  actually  occurred. 
Whenever  a  transition  between  word  phonemes  is  made,  a  check 
is  performed  to  determine  if  a  phoneme  was  deleted.  If  a 
deletion  occurred,  then  a  count  indicating  the  number  of  word 
phonemes  that  have  been  deleted  is  incremented,  and  a  score 
based  on  the  deletion  statistics  is  updated  using  the  equa¬ 
tion 
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aelw(new)  =  delw(old)  +  [ (ndelw ^ ^ /nospoke) /dele] 


where  delw  indicates  the  word's  deletion  score  based  on  sta¬ 
tistics,  ndel^^j  is  the  statistic  indicating  the  number  of 
times  word  phoneme  w(j)  was  deleted  in  the  "nospoke"  training 
files  used  to  collect  statistics;  and  dele  and  delf  are  fuzzy 
variables  used  to  adjust  the  importance  of  the  deletion  sta¬ 
tistics  on  the  deletion  score. 

The  process  essentially  remains  the  same  when  there  are 
only  two  word  phonemes  remaining,  except  that  the  vector  score 
for  the  second  phoneme  following  the  current  one  is  simply 
considered  to  be  zero  (since  it  does  not  exist) .  When  there 
are  less  than  four  vectors  remaining  to  be  examined  in  the 
speech  file,  the  process  is  slightly  modified  to  discourage 
deletion  errors.  This  change  simply  consists  of  making  the 
word  phoneme  with  the  highest  vector  score  the  current  phoneme 
rather  than  performing  any  further  tests. 


Total  Word  Score 

The  word  score  obtained  for  each  word  is  based  on  the 
deletion  count  and  deletion  score  delw  given  in  the  last  sec¬ 
tion,  and  on  the  vector  score,  VS^,  obtained  for  each  vector. 
The  total  deletion  score  for  a  word  is  given  by  the  equation 


total  deletion  score  = 


1  -  [1  -  ((1  -  (deletion  count/ 
wordlength)  x  dcng) 

+  ((1  -  dcng)  x  Jdelw/deletion 
count) ) ]/dcne]dcn*  (43) 


where  wordlength  is  the  number  of  phonemes  in  the  word's 
representation,  dcng  is  a  fuzzy  variable  used  to  vary  the 
importance  of  the  number  of  deletions  in  a  word  versus  the 
deletion  statistics;  and  dene  and  denf  are  fuzzy  variables 
used  to  alter  the  effect  of  the  total  deletion  score  on  the 
word  score.  The  quantity 

(1  -  (deletion  count/wordlength) 
used  in  the  above  equation  is  used  to  overcome  the  problem 
that  arises  from  varying  word  lengths  for  different  words. 

For  example,  if  a  word  with  10  word  phonemes  in  its  represen¬ 
tation  has  2  deletions,  then  obviously  this  word's  deletion 
score  should  be  better  than  a  word  with  2  deletions  out  of 
only  3  word  phonemes. 

The  word  score,  WS,  for  the  word  being  examined  then 
becomes 


WS  =  (delg  x  (total  deletion  score)) 

+  (1  -  delg)  x  (  i  VS  ) /t)  (44) 

i=l  1 

where  t  is  the  total  number  of  vectors,  VSi  is  the  vector 
score  for  vector  i,  and  delg  is  a  fuzzy  variable  which  alters 
the  importance  of  the  deletion  score  versus  the  average  vec¬ 
tor  score. 

After  the  word  scoring  algorithm  presented  in  this  chap¬ 
ter  is  applied  to  each  word  in  the  vocabulary,  the  word  cho¬ 
sen  as  the  one  spoken  is  given  by  the  highest  word  score. 
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Computation  of  Fuzzy  Statistics 


This  section  presents  the  fuzzy  algorithm  used  to  gener¬ 
ate  the  fuzzy  statistics  discussed  at  the  beginning  of  this 
chapter.  Initially,  the  substitution,  insertion,  and  dele¬ 
tion  statistics  are  set  at  zero.  The  word  recognition  algo¬ 
rithm  is  then  applied  to  each  training  file,  and  the  results 
of  this  operation  are  used  to  generate  the  statistics.  When 
the  recognition  algorithm  is  applied  to  a  file,  the  following 
information  is  obtained  for  each  vector  i: 

-  the  word  phoneme  chosen, 

-  the  error  that  occurred  (sustitution  or  insertion) ,  and 

-  the  vector  score. 

For  example,  the  results  of  applying  the  recognition  algo¬ 
rithm  to  the  speech  file  shown  in  Figure  2  using  the  data 
shown  in  Figures  8  through  12,  is  shown  in  Figure  14.  This 
information  is  then  coordinated  with  the  original  training 
file  to  create  the  statistics. 

First  the  weights  assigned  to  each  phoneme  choice  for 
each  vector  are  combined  with  the  scale  factor  to  derive  the 
fuzzy  set  which  was  defined  in  the  vector  scoring  section. 
Then  by  defining  the  set 

susw  (j)  =  {sVj,  sv2,  ...,  svk>  (45) 

as  the  set  of  vectors  where  a  substitution  error  occurred  for 
word  phoneme  w(j),  and  the  set 
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Fig.  14.  Vector  Results  Obtained  by  Applying  Recognition  Algorithm  to  the 
Speech  File  Shown  in  Figure  2  Using  the  Data  Shown  in  Figures 
8  Through  12 


•  •  •  / 


(46) 


INS 


w(  j) 
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AV 


as  the  set  of  vectors  where  an  insertion  error  occurred  for 
word  phoneme  w(j),  the  fuzzy  statistics  can  be  computed. 

Note  that  the  set  of  vectors  where  phoneme  w(j)  was  chosen 
is  given  by  the  union  of  sets  SUBw^  and  INSw^j. 

If  SUBwy)  is  the  null  set,  then  phoneme  w(j)  is  con¬ 
sidered  deleted,  and  the  statistic  representing  the  number 
of  times  a  word  phoneme  is  deleted  is  incremented  as  follows 


ndel 


w(j) 


ndel 


w(j) 


+  1 


otherwise  ndel^^jj  remains  unchanged. 
Defining  the  fuzzy  sets 


(47) 


PSw(j)  =  <VSl,w(j)'  P2/S2,w(j) . Pc/Sc,w(j) . 


VSn,w(j)} 


(48) 


PIw(j)  =  {Pl/:El,w(j)'  P2/X2fw(j) . WwU) 

Pn^In,w(j)  * 


9  •  •  •  9 


(49) 


as  the  sets  representing  the  substitution  and  insertion  sta¬ 
tistics,  respectively,  for  phoneme  w(j)  using  the  training 
file  being  examined;  the  fuzzy  statistics  for  a  word  can  be 
updated  for  each  training  file.  These  two  sets  are  computed 
respectively,  using  the  equations 


FS 


w(j) 


=  A 


sv, 


U  A 


SV. 


U  A 


sv. 


U 


U  A 


SV, 


(50) 
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1*  V 


FI  ...  =  A .  „  U  A.„  U  A.  U 

w(j)  1VX  1V2  IV 


U  A. 


iv. 


(51) 


where  U  indicates  the  union  operation  discussed  in  Chapter  3. 
The  new  substitution  or  insertion  statistics  are  then  updated 
using  the  equations 


PSc,w<j)  <ne“>  -  PSc,w(J)  (°ld)  +  Sc.w(j)  <52> 

PIc,w(j)  <"*«>  =  PIc,w(J)  <°ld)  +  <53> 

where  PSC  w(j)  and  PIC  w(j)  were  defined  in  the  vector  scor¬ 
ing  section.  Once  these  statistics  are  computed  for  each 
word  phoneme  in  the  word  representation,  the  quantity  nospoke, 
indicating  the  number  of  training  files  used  to  generate  the 
statistics,  is  incremented. 

Any  number  of  training  files  can  be  used  to  generate  the 
statistics  for  a  word.  The  statistics  could  also  be  updated 
whenever  a  recognition  attempt  is  made,  provided  that  the 
actual  word  is  known. 

The  advantage  of  using  statistics  based  on  the  output  of 
the  recognition  algorithm,  rather  than  on  its  input,  is  that 
errors  or  peculiarities  of  the  algorithm  can  be  somewhat  over¬ 
come  using  this  approach. 
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V.  Determination  of  Word  Representations 


As  noted  in  Chapter  2,  the  output  of  the  acoustic  pro¬ 
cessor  is  extremely  variable.  Therefore  the  initial  problem 
was  not  one  of  determining  the  word  spoken  using  a  set  of 
known  phoneme  representations,  but  was  one  of  first  deter¬ 
mining  the  most  consistent  phoneme  representation  for  each 
word  in  the  vocabulary.  This  was  necessary  since  the  expected 
phoneme  strings  for  an  utterance  did  not  correspond  with  the 
actual  strings  that  were  output.  Although  the  fuzzy  statis¬ 
tics  and  variables  provide  a  basis  for  overcoming  this  pro¬ 
blem,  it  was  found  that  the  accuracy  of  the  recognition  algo¬ 
rithm  increased  substantially  when  a  more  consistent  phoneme 
representation  was  used. 

At  first  it  was  thought  that  a  more  consistent  represen¬ 
tation  for  a  word  could  easily  be  found  by  examining  a  number 
of  sample  output  files  for  the  word;  however,  it  soon  became 
apparent  that  this  was  not  a  simple  task.  One  reason  for 
this  is  that  the  output  is  so  variable  that  no  obvious  repre¬ 
sentation  exists.  Another  problem  is  that  the  word  recogni¬ 
tion  algorithm  attempts  to  overcome  the  errors  present  in  a 
word's  representation,  making  it  virtually  impossible  to 
determine  if  a  given  representation  is  the  best  one  possible. 
Because  of  these  problems,  an  algorithm  which  automatically 
determines  the  phoneme  representation  for  a  word  was  developed. 


It  was  found  that,  in  general,  this  algorithm  generated  bet¬ 
ter  word  representations  than  could  be  generated  manually. 
However,  this  algorithm  does  not  necessarily  derive  optimal 
representations,  and  further  research  in  this  area  is  recom¬ 
mended  . 

The  determine  word  representation  algorithm  first  creates 
an  initial  representation  for  the  word  using  the  word's  train¬ 
ing  files,  and  then  attempts  to  improve  it  using  fuzzy  sta¬ 
tistics  generated  for  this  representation.  Before  an  initial 
representation  is  created,  each  vector  of  each  training  file 
is  converted  into  the  fuzzy  sets  A^,  which  were  defined  in 
the  vector  scoring  section  of  the  last  chapter.  However, 
since  a  number  of  files  are  operated  on  in  parallel,  another 
index,  g,  was  added  to  indicate  the  training  file  being 
examined.  The  fuzzy  set,  A.  is  then  of  the  form 


Ai,g  "  {Pl/al,i,g'  P2/a2,i,g . Pc/ac,i,g . 

Pn^an,i,g* 


(54) 


with  a  .  computed  in  the  same  manner  as  a  .  was  in  the 
C/ 1 #  y  c#i 

last  chapter.  The  initial  representation  can  then  be  devel¬ 
oped  using  the  summation  of  these  fuzzy  sets  for  a  specified 
number  of  vectors. 

The  first  phoneme  of  the  initial  word  representation, 
w(l),  is  given  by  the  expression 

G  M 

w(l)  =  Max  {  £  l  A.  } 


(55) 


where  G  is  the  number  of  training  files  for  the  word  being 

examined,  M  is  the  number  of  vectors  to  be  summed  (typically 

chosen  as  the  number  of  vectors  corresponding  to  the  duration 

G  M 

of  one  phoneme),  and  E  E  A.  is  a  fuzzy  set  of  the  form 

g=l  i=l  1,g 


Ai,  g  =  {Vdl-  P2/d2' 


Vdc'  -  Pn/dn * 


with  grades  of  membership,  dc<  given  by  the  equation 


G  M 

d  =  [  E  E  a  ]  /  (M  *  G) 

c  g=l  i=l  c, i,gJ/  ' 


Since  the  lengths,  in  vectors,  of  a  word  phoneme  varies 
for  each  training  file,  it  is  necessary  to  find  the  vector 
where  a  transition  occurs  between  word  phonemes  for  each 
file.  To  accomplish  this,  a  threshold  is  used  to  indicate 
where  a  word  phoneme  ends  in  a  training  file.  The  last  vec¬ 
tor  of  the  word  phoneme  is  given  by  the  vector,  i,  satisfy¬ 
ing  the  expression 

i+M 

(  2  auH1  v  _)/M  <  threshold  (58) 

k=i+l  ,K'g 

with  M  being  a  constant  indicating  the  number  of  vectors  to 

be  averaged.  Once  this  vector  is  determined  for  the  file  g, 

a  score  is  obtained  for  each  prototype  phoneme  based  on  the 

summation  of  the  fuzzy  sets,  A.  ,  for  the  next  M  vectors, 

i  /  y 

as  shown  by  the  equation 


G 

g 


i+M 

z 

k=i+l 


(59) 


Fuzzy  set  Gg  provides  a  measure  of  the  most  likely  word 
phoneme,  w(j+l),  that  follows  the  current  word  phoneme,  w(j), 
in  training  file  g.  Repeating  this  operation  for  all  the 
training  files,  and  slimming  the  resulting  fuzzy  sets  Gg,  the 
next  word  phoneme  of  the  initial  representation,  w(j+l),  is 
given  by 

G 

w(j+l)  =  Max  {  Z  G  }  (60) 

g=l  g 

This  entire  process  is  repeated  until  every  vector  of  all  the 
word's  training  files  have  been  accounted  for.  When  the 
initial  representation  is  completed,  an  operation  is  performed 
to  insure  that  no  two  of  three  consecutive  word  phonemes  are 
duplicated.  This  is  required  since  the  transition  algorithm 
determines  where  transitions  occur  based  on  the  vector  scores 
of  the  current  and  next  two  word  phonemes  in  the  representa¬ 
tion.  If  any  two  of  three  consecutive  phonemes  are  the  same, 
then  the  results  may  become  distorted.  This  problem  is  al¬ 
leviated  by  simply  deleting  any  phoneme  in  the  representation 
with  a  duplicate  as  one  of  the  preceeding  two  word  phonemes. 

After  an  initial  representation  is  obtained,  an  attempt 
is  made  to  improve  it  by  using  the  representation  to  generate 
fuzzy  statistics.  These  statistics  are  generated  by  applying 
the  word  recognition  algorithm  on  the  word's  training  files 


60 


as  discussed  in  the  previous  chapter.  At  present,  there  are 
two  separate  operations  that  are  used  to  improve  the  repre¬ 
sentation  based  on  the  statistics. 

The  first  operation  uses  the  substitution  and  insertion 
statistics  to  improve  the  representation.  After  the  statis¬ 
tics  are  generated  using  the  initial  representation,  the  sub¬ 
stitution  and  insertion  statistics  are  combined  to  form  fuzzy 
sets  for  each  word  phoneme  of  the  form 


PS  ,  and  PI  have  been  defined  in  the  last  chapter.  These 
c  c 

sets  are  simply  the  summation  of  the  corresponding  substitu¬ 
tion  and  insertion  plausibilities  for  each  word  phoneme. 


Given  the  set  °w(j)  for  word  phoneme  w(j), 
sets  Ow(0)  and  0w(„+1)  as 


and  defining  the 


w  (0) 


w(N+l) 


=  {Pj/0,  p2/o. 


Pc/0, 


pn/°> 


where  N  is  the  number  of  word  phonemes  in  the  representation, 
a  phoneme  is  inserted  before  w(j)  in  the  representation  if 
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(w(j)  <>  Max  (Ow(j)»  and  <0^^ 
>=  °Max{0w(j)  },w(j+l) }  and 


*°Max{0w^ j  },w(j-l)  >_  °w (j)  ,  w  ( j — 1 ) 


otherwise  if 

w(j)  <>  Max  (65) 

the  phoneme  given  by  Max  is  inserted  after  word 

phoneme  w(j).  If  neither  of  the  above  conditions  is  met, 
then  no  phonemes  are  added.  This  process  is  repeated  for 
each  word  phoneme  w(j),  where  1  <=  j  <=  N.  When  this  oper¬ 
ation  is  completed,  the  new  representation  is  revised  to 
eliminate  any  duplicates  occurring  within  any  three  consecu¬ 
tive  word  phonemes  as  before,  and  new  statistics  are  computed 
using  the  revised  representation. 

The  second  operation  used  to  improve  the  representation 
given  above,  simply  eliminates  the  word  phonemes  that  satisfy 
the  condition 

ndel  >  (nospoke/2)  +  1  (66) 

w  (j ) 

where  ndelw^j  and  nospoke  have  been  defined  previously.  The 
purpose  of  this  operation  is  to  delete  the  phonemes  in  the 
representation  that  appear  to  be  invalid.  These  two  opera¬ 
tions  can  be  performed  iteratively  for  any  specified  number 
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of  iterations 


As  indicated  at  the  beginning  of  this  chapter,  the  de¬ 
termine  word  representation  algorithm  does  not  necessarily 
produce  optimum  word  representations.  This  algorithm  is  only 
a  suggested  approach,  and  other  algorithms,  or  variations  of 
this  one,  should  be  examined. 
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I.  Optimization  of  the  Fuzzy  Variables 


Because  of  the  nature  of  the  word  recognition  algorithm, 
it  is  virtually  impossible  to  determine  the  optimum  values 
for  the  fuzzy  variables.  As  explained  throughout  this  the¬ 
sis,  the  purpose  of  the  fuzzy  variables  is  to  alter  the  im¬ 
portance  of  each  of  the  factors  used  to  determine  the  word 
that  was  spoken.  One  method  that  is  typically  used  to  ac¬ 
count  for  the  variations  in  importance  of  the  different  fac¬ 
tors,  is  to  assign  them  fixed  values.  However  this  is  usu¬ 
ally  done  unknowingly  because  the  factors  become  embedded 
in  the  algorithm's  design.  For  example,  scaling  the  dis¬ 
tances  between  0  and  100,  and  providing  a  scale  factor  for 
each  vector  of  the  acoustic  output  (discussed  in  Chapter  2) , 
to  vary  the  importance  of  the  distances  obtained  for  one 
vector  versus  the  other  vectors,  usually  would  not  even  be 
considered.  If  this  were  not  done,  the  performance  of  the 
recognition  algorithm  would  be  much  lower.  Although  there 
is  no  way  of  deriving  an  optimum  set  of  fuzzy  variables  to 
maximize  the  algorithm's  performance,  a  reasonable  set  of 
variables  can  be  chosen  by  examining  the  performance  of  the 
algorithm  for  different  values  of  the  variables. 

The  algorithm  that  was  developed  to  find  a  reasonable 
set  of  word  fuzzy  variables  uses  the  performance  information 
shown  in  Figure  9  of  Chapter  4.  Basically,  each  word  fuzzy 


variable  is  altered  within  specified  limits,  and  performance 
data  is  derived  by  applying  the  recognition  algorithm  to  a 
set  of  sample  speech  files.  The  variable's  value  and  limits 
are  then  altered  based  on  this  data.  It  should  be  under¬ 
stood  that  this  method  is  only  one  possible  method  that 

could  have  been  employed,  and  is  presented  only  as  an  exam¬ 
ple  of  the  type  of  algorithm  that  may  be  useful  in  finding 

a  reasonable  set  of  variables  for  a  word. 

If  variable  names  are  assigned  to  the  performance  data 
of  Figure  9  as  follows: 

nospoke:  the  number  of  training  speech  files  that  were 
used  to  determine  the  fuzzy  statistics  for 
the  word; 

noattempti  the  number  of  recognition  attempts  that  were 
made  for  the  word; 

nocorrect:  the  number  of  times  the  word  was  correctly 

recognized; 

avescore:  the  average  recognition  score; 

avediff:  the  average  difference  between  the  word's 

recognition  score  and  the  next  highest  score 
(only  computed  when  word  is  correctly  recog¬ 
nized)  ;  and 

mindiff:  the  minimum  difference  between  the  word's 

recognition  score  and  the  next  highest  score 
(also  only  computed  when  word  is  correctly 
recognized) ; 


then  a  figure  of  merit  can  be  computed  which  will  indicate 
the  performance  of  the  recognition  algorithm  for  a  given  set 
of  fuzzy  variables.  The  equation  used  to  compute  the  figure 
of  merit,  FM,  is 

FM  =  FV1  x  (nocorrect/noattempt)  +  FV2  x  (avescore) 

+  FV3  x  avediff  +  FV4  x  mindiff  (67) 

where  FV1,  FV2,  FV3,  and  FV4  are  fuzzy  variables  subjectively 
chosen  to  establish  the  importance  of  each  of  the  above  fac¬ 
tors  on  the  figure  of  merit  (FV1  +  FV2  +  FV3  +  FV4  =  1) . 

This  figure  can  then  be  used  to  determine  whether  one  value 
of  a  fuzzy  variable  produces  better  results  than  another 
value. 

Initially,  values,  minimum  limits,  and  maximum  limits 
are  assigned  to  the  word  fuzzy  variables  that  are  to  be  eval¬ 
uated.  Then  the  figure  of  merit  is  computed  for  this  set  of 
variables  by  applying  the  word  recognition  algorithm  to  a  set 
of  sample  speech  files.  One  of  the  variables  is  then  altered 
using  the  equation 

fuzzyvar (new)  =  (fuzzyvar (initial)  +  minlimit)/2  (68) 

where  fuzzyvar  is  the  value  of  the  variable,  and  minlimit  is 
its  minimum  limit.  The  recognition  algorithm  is  again  applied 
to  the  same  set  of  speech  files,  and  a  new  figure  of  merit  is 
computed.  If  the  new  figure  of  merit  is  greater  than  the 


initial  figure  of  merit,  then  the  variable's  maximum  limit 
is  set  to  the  variable's  initial  value,  and  the  variable's 
current  value  remains  unchanged.  However,  if  the  new  figure 
of  merit  is  less  than  the  initial  one,  then  the  variable's 
minimum  limit  and  current  value  are  set  to  its  initial  value, 
and  another  figure  of  merit  is  computed  for  the  variable 
using  the  value  given  by 

fuzzyvar (new)  =  (fuzzyvar (initial)  +  maxlimit)/2  (69) 

where  maxlimit  is  the  variable's  maximum  limit.  If  the  fig¬ 
ure  of  merit  computed  using  this  value  is  greater  than  the 
initial  figure  of  merit,  then  the  variable's  value  remains 
unchanged,  and  its  minimum  limit  is  set  to  the  variable's 
initial  value:  otherwise,  the  variable's  maximum  limit  is 
set  to  the  variable's  current  value,  and  then  its  value 
is  returned  to  its  initial  value.  This  process  is  repeated 
for  each  word  fuzzy  variable  being  evaluated  for  a  given  set 
of  words,  and  continues  until  the  program  is  aborted. 

Since  the  user  has  the  option  of  choosing  which  fuzzy 
variables  to  examine,  and  which  words  are  to  be  used;  he  can 
specify  any  subset  of  fuzzy  variables  for  any  subset  of 
words  to  be  evaluated  in  an  attempt  to  maximize  the  system's 
performance.  It  should  be  noted  that  the  above  process  is 
applied  to  each  variable  for  the  complete  set  of  words,  and 
then  to  the  other  variables  being  evaluated,  before  the 


process  is  again  repeated  for  the  same  variable. 

Since  this  algorithm  attempts  to  find  the  local  maxi¬ 
mum,  in  terms  of  performance,  for  each  variable,  there  is 
no  guarantee  that  the  value  generated  for  the  variable  is 
optimum  because  of  the  non-linearity  of  the  algorithm. 
Another  problem  with  attempting  to  determine  the  optimum 
set  of  fuzzy  variables  for  a  word,  is  that  the  variables 
are  extremely  dependent  on  the  phoneme  representation  chosen 
for  the  word,  and  on  the  sample  speech  files  used  to  compute 
the  figures  of  merit.  Also,  as  might  have  been  guessed,  the 
algorithm  is  extremely  inefficient,  and  may  require  many 
hours  of  computer  time  on  a  conventional  computer  system  to 
arrive  at  a  reasonable  set  of  variables. 


VII.  Results 


The  software  developed  to  support  this  research  pro¬ 
vides  numerous  options.  It  has  been  designed  so  that  the 
only  input  required  to  initialize  the  system  is  a  set  of 
sample  training  files  for  each  word  in  the  vocabulary.  Since 

the  system  is  completely  independent  of:  the  vocabulary,  the 

* 

acoustic  process  (provided  that  phonemes  are  extracted) ,  and 
the  restrictions  placed  on  the  system's  input;  numerous  ex¬ 
periments  can  be  performed  to  assist  in  the  development  of 
an  isolated  word  recognition  system.  Some  experiments  that 
can  be  easily  performed  include: 

-  determining  the  effect  of  increasing,  or  altering  the 

tr  vocabulary ; 

-  analyzing  the  system's  performance  for  various  groups 
of  speakers; 

-  examining  how  the  system's  accuracy  degrades  for  in¬ 
creasing  levels  of  background  noise;  and 

-  evaluating  variations  of  the  algorithms  employed  in 
the  acoustic  process,  or  evaluating  the  performance 
of  a  variety  of  different  acoustic  processors  which 
output  phoneme  strings. 

Figure  15  illustrates  the  general  steps  required  to  initialize 
the  recognition  algorithm  for  a  given  experiment.  Once  the 
system  has  been  initialized,  results  can  be  obtained  by  fol- 
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1.  Digitize  a  number  of  speech  files  for  each  vocabu¬ 
lary  word  to  be  used  for  training  (using  program 
AUDIOHIST) . 

2.  Execute  programs  PHDIST  and  CHOICES  for  each  speech 
file  to  obtain  the  output  shown  in  Chapter  2  (these 
programs  are  given  in  the  appendix) . 

3.  Create  the  following  data  files  for  the  experiment 
being  performed: 

- SPECH, - RECOG, - WFILE, - SPILE, - FZOPT 

where  " - "  represents  any  three  characters,  and 

are  used  to  indicate  the  experiment  being  performed. 

4.  Store  all  the  training  files  in  data  files  - SPECH, 

and  - RECOG  inputting  an  arbitrary  phoneme  repre¬ 

sentation  for  each  word  (this  step  is  accomplished 
using  an  option  of  program  LEARN) . 

5.  Execute  the  "Determine  Word  Representation"  pro¬ 
cedure  for  all  words  (option  of  program  LEARN) ,  to 
obtain  an  initial  representation  for  each  word. 
Experimentation  using  this  representation  as  a 
basis  may  lead  to  a  better  representation,  and 
should  be  performed.  Note  that  this  step  may  be 
bypassed  if  valid  representations  already  exist. 
Whenever  word  representations  are  altered  or  input 
manually  procedure  "Initstat"  of  program  LEARN  must 
be  executed  to  initialize  the  word  statistics. 

6.  Initialize  the  overall  and  word  fuzzy  variables, 
and  execute  the  "Fuzzy  Variable  Optimization" 
procedure,  given  in  program  LEARN,  to  optimize 
the  word  variables.  These  variables  can  also  be 
optimized  manually. 

7.  Delete  and  re-create  data  file  - RECOG  to  ini¬ 

tialize  it  for  recognition  purposes. 


Fig.  15.  Steps  Required  to  Initialize  Recognition 
Algorithm  for  an  Experiment 
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lowing  the  steps  outlined  in  Figure  16.  These  steps  provide 
only  one  example  of  the  many  options  that  are  provided  by 
the  software.  Refer  to  the  documentation  given  for  program 
LEARN  in  the  appendices  for  a  discussion  of  all  the  available 
options . 

Although  simi lar  experiments  can  be  performed  with  other 

word  recognition  algorithms,  they  are  very  difficult  to  ac- 

* 

complish.  Other  systems  are  very  dependent  on  the  exact 
acoustic  process  employed,  and  usually  require  detailed 
knowledge  of  the  features  being  extracted.  These  systems 
also  require  a  high  level  of  accuracy  for  identifying  the 
features,  and  their  performance  degrades  rapidly  as  the  num¬ 
ber  of  feature  extraction  errors  increases.  The  word  recog¬ 
nition  algorithm  developed  in  this  thesis  does  not  depend 
on  the  details  of  the  acoustic  processor,  or  features  ex¬ 
tracted;  and  degrades  gracefully  as  the  number  of  errors 
made  by  the  acoustic  processor  increases.  This  is  accom¬ 
plished  from  the  fact  that  the  only  requirement  imposed  by 
this  algorithm  is  that  the  features  output  for  a  given  word 
exhibit  some  degree  of  consistency.  The  success  of  this 
algorithm  relies  solely  on  its  ability  to  determine  what 
sequences  of  features  are  consistent  for  a  word,  given  a 
number  of  sample  acoustic  outputs  for  the  word.  Although 
the  word  recognition  algorithm  was  designed  to  determine  this 
consistency  without  any  human  intervention,  problems 


Repeat  steps  1  and  2  shown  in  Figure  15  to  obtain 
a  set  of  speech  fi3es  to  be  used  for  recognition 
purposes . 

Store  all  the  files  obtained  from  the  previous 

step  in  data  file  -  RECOG  (using  an  option  of 

program  LEARN) . 

Execute  procedure  "Recogall"  of  program  LEARN  to 
obtain  recognition  results  for  the  files  stored  in 

-  RECOG.  The  results  of  this  operation  can  be 

obtained  by  executing  program  OUTSTAT.  This  pro¬ 
gram  provides  the  information  shown  in  Figures  8 
through  12  for  each  word.  Detailed  recognition 

information  for  each  file  stored  in  -  RECOG 

will  also  be  given  in  file  "RESULTS"  after  the 
completion  of  procedure  "Recogall". 


Fig.  16.  Steps  required  to  obtain  recognition  results 
after  the  steps  in  Figure  15  have  been  exe¬ 
cuted. 


associated  with  the  Determine  Word  Representation  and  Opti¬ 
mize  Fuzzy  Variable  algorithms  make  human  intervention  neces¬ 
sary  to  obtain  the  best  results. 

It  was  found  that  the  Determine  Word  Representation 
algorithm  does  not  always  generate  a  reasonable  word  repre¬ 
sentation  using  an  acoustic  process  that  does  not  produce  a 
consistent  sequence  of  phonemes  for  a  word.  Experimentation 
indicated  that  better  word  representations  were  generated 
for  words  with  more  consistent  outputs,  as  expected.  Re¬ 
examination  of  the  acoustic  outputs  for  the  word  "zero", 
given  in  Figures  2  through  7  of  Chapter  2,  will  illustrate 
the  difficulty  in  obtaining  a  valid  word  representation. 

Since  it  was  found  that  in  some  instances  the  Determine  Word 
Representation  algorithm  produces  better  representations 
than  those  generated  manually,  the  approach  that  was  used  to 
generate  a  word  representation  consisted  of  using  the  algo¬ 
rithm  to  derive  an  initial  representation,  and  manually  ex¬ 
perimenting  with  this  representation  to  determine  if  a  better 
one  could  be  found. 

As  indicated  in  Chapter  6,  the  Optimize  Fuzzy  Variable 
algorithm  requires  many  hours  of  computer  time  on  a  conven¬ 
tional  computer  to  generate  a  reasonable  set  of  variables 
for  a  word.  Because  of  this,  this  algorithm  was  only  employed 
for  a  small  set  of  words  to  generate  a  preliminary  set  of 
variables.  These  variables  were  then  "optimized"  by  trial 
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and  error  to  derive  a  general  set  of  fuzzy  variables  that 
were  used  for  all  the  words  in  the  vocabulary.  Although 
these  variables  could  have  been  further  "optimized"  on  a 
word  basis;  experimentation  indicated  that  such  optimization 
only  provided  a  minor  increase  in  the  system's  overall  recog¬ 
nition  accuracy,  and  therefore  should  only  be  accomplished 
after  a  fairly  high  recognition  accuracy  is  obtained  using 
the  same  variables  for  each  word.  In  other  words,  reasonably 
accurate  results  can  be  obtained  using  the  same  variables  for 
each  word. 

The  word  recognition  algorithm  exhibited  a  unique  sta¬ 
bility  which  lessened  the  difficulty  in  handling  the  above 
problems.  It  was  found  that  numerous  word  representations 
produce  similar  results.  In  fact,  almost  any  representation 
that  is  chosen  by  examining  sample  speech  files  of  a  word 
produces  results  that  substantially  exceed  chance.  It  was 
also  found  that  minor  variations  of  most  of  the  fuzzy  vari¬ 
ables  only  had  a  minor  impact  on  the  system's  performance. 
These  characteristics  are  exhibited  because  of  the  manner  in 
which  the  word  fuzzy  statistics  are  generated,  and  because 
of  the  statistics  impact  on  the  word  scores. 

To  indicate  the  significance  of  the  word  recognition 
algorithm  developed  in  this  thesis,  two  experiments  were 
performed.  For  each  of  these  experiments  the  acoustic  pro¬ 
cessor  developed  by  Karl  Seelandt  was  employed  using  the  71 


prototype  phonemes  shown  in  Table  I.  As  indicated  in  Chap¬ 
ter  2,  these  prototypes  were  derived  from  Seelandt's  speech 
for  the  words  "zero"  through  "nine".  The  vocabulary  that 
was  chosen  consisted  of  15  words.  These  words,  and  their 
corresponding  phoneme  representations,  generated  using  sam¬ 
ple  speech  files  spoken  by  GJM,  are  shown  in  Figure  17. 

Note  that  many  of  the  sounds  occurring  in  the  last  5  words 
do  not  correspond  to  any  of  the  prototypes  in  the  phoneme 
set,  and  that  the  representations  chosen  for  the  words  "zero" 
through  "nine"  do  not  correspond  to  the  expected  representa¬ 
tions  shown  in  Table  I.  The  overall  fuzzy  variables  that 
were  used  to  collect  the  word  fuzzy  statistics,  and  the  word 
fuzzy  variables  that  were  used  for  all  the  words  in  the  vo¬ 
cabulary  were  shown  in  Figure  9. 

Experiment  1.  This  experiment  provides  an  indication 
of  the  algorithm's  performance  when  recognizing  the  speech 
of  a  dependent  speaker.  Statistics  were  collected  for  each 
word  using  5  training  files  spoken  by  GJM.  The  recognition 
algorithm  was  then  applied  to  90  speech  files  (6  for  each 
word)  that  were  also  generated  by  GJM.  Of  the  90  speech 
files,  82.2%  were  identified  correctly,  with  91.1%  of  the 
correct  words  being  among  the  top  2  choices  (the  2  highest 
scoring  words) . 

Experiment  2 .  This  experiment  illustrates  the  perfor¬ 
mance  of  the  recognition  algorithm  for  recognizing  the  speech 


WORD 


PHONEME  REPRESENTATION 


ZERO 

21 

- 

70 

-  7  - 

12  - 

62 

ONE 

33 

- 

39 

-  70 

TWO 

56 

- 

70 

-  62 

THREE 

23 

- 

27 

-  70 

-  62 

FOUR 

33 

- 

34 

-  54 

FIVE 

39 

- 

7  - 

54 

SIX 

56 

- 

47 

-  1  - 

29 

SEVEN 

12 

- 

7  - 

63  - 

62 

EIGHT 

48 

- 

57 

-  1  - 

56  - 

62 

NINE 

63 

- 

17 

-  42 

-  62 

CCIP 

70 

- 

56 

-  57 

-  52  - 

1 

ENTER 

21 

- 

69 

-  70 

-  63 

FREQUENCY 

27 

- 

70 

-  1  - 

54  - 

23 

STEP 

12 

- 

9  - 

52  - 

1 

THREAT 

7  - 

52 

Fig.  17.  Vocabulary  and  Corresponding  Word 
Representations 


of  several  speakers  without  first  training  the  system  for 
these  speakers.  The  system  attempted  to  recognize  the  speech 
of  seven  male  speakers,  with  different  accents,  training  the 
system  with  10  GJM  speech  files  for  each  word.  The  results 
of  this  experiment  for  each  speaker  are  shown  in  Figure  18. 

As  shown,  a  total  of  53.6%  of  the  speech  files  were  correctly 
identified,  with  76.0%  of  the  correct  words  being  among  the 
top  2  choices. 

The  above  results  indicate  that  the  word  recognition 
algorithm  developed  in  this  thesis  is  capable  of  achieving 
reasonable  levels  of  accuracy  given  a  very  inaccurate  input. 
At  present  the  author  is  unaware  of  any  other  algorithms 
capable  of  achieving  similar  results  given  similar  input. 

The  results  of  Experiment  2  indicate  that  there  is  some  con¬ 
sistency  in  the  output  of  the  acoustic  analyzer  for  speakers 
with  different  characteristics.  The  results  obtained  can 
easily  be  improved  in  a  number  of  ways.  For  example,  the 
use  of  more  training  files  should  produce  more  accurate  re¬ 
sults,  and  the  use  of  a  training  set  consisting  of  the  speech 
of  a  number  of  speakers  should  significantly  improve  the 
accuracy  obtained  in  Experiment  2.  Further  experimentation 
will  most  likely  produce  word  representations,  and  fuzzy 
variables  that  may  significantly  increase  the  accuracy  ob¬ 
tained  for  the  above  experiments.  It  can  easily  be  verified 
that  100%  accuracy  can  be  obtained  by  this  algorithm  given 
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SPEAKER 

NO.  SPEECH 
FILES 

PERCENTAGE 

CORRECTLY 

IDENTIFIED 

PERCENTAGE 

AMONG  TOP 

2  CHOICES 

BDB 

30 

50.2 

70.0 

DCD 

30 

56.7 

70.7 

DEP 

10 

50.0 

90.0 

JHC 

15 

46.7 

60.7 

MJK 

10 

60.0 

90,0 

RLC 

15 

60.7 

86.7 

RWH 

15 

46.7 

61.3 

TOTAL 

125 

53.6 

76.0 

Fig.  18.  Results  of  Experiment  2 
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a  perfect  input  from  the  acoustic  processor.  Therefore  as 
the  accuracy  of  the  acoustic  process  improves,  the  accuracy 
of  the  word  recognition  algorithm  will  also  improve. 


VIII.  Conclusions 

The  word  recognition  algorithm  developed  in  this  research 
obtains  reasonable  performance  goals  despite  the  inaccuracy 
of  the  acoustic  processor,  and  the  intrinsic  variability  of 
the  input  speech.  This  fact  implies  that  it  may  not  be  neces¬ 
sary  to  develop  an  extremely  accurate  feature  extraction  pro¬ 
cess  in  the  acoustic  analyzer  to  achieve  automatic  speech 
recognition.  Although  this  algorithm  obviously  does  not 
represent  the  final  solution  to  the  speech  recognition  pro¬ 
blem,  it  definitely  suggests  a  useful  approach  for  developing 
the  required  decision  process  to  solve  this  problem,  and  per¬ 
haps  many  other  pattern  recognition  problems.  The  relative 
success  of  the  recognition  algorithm  indicates  that  fuzzy  set 
theory  may  provide  the  formalism  necessary  to  overcome  the 
problems  that  arise  when  making  complex  decisions  based  on  a 
number  of  variable  and  error  prone  factors. 

Fuzzy  set  theory  provides  a  means  to  deal  with  the  im- 
precisions  involved  in  making  decisions.  It  allows  decisions 
to  be  based  on  a  more  intuitive  level  rather  than  on  formal 
mathematical  reasoning.  The  advantage  of  this  can  easily  be 
seen  by  examining  some  of  the  many  heuristics  used  throughout 
artificial  intelligence  and  pattern  recognition  which  pro¬ 
vide  the  only  practical  solution  to  many  problems.  The  fuzzy 
statistics  used  in  the  algorithm  enable  the  algorithm  to 


automatically  adapt  to  the  input  based  on  its  actual  perfor¬ 
mance.  Since  these  statistics  are  collected  on  a  word  basis, 
the  need  for  phonological  rules  is  eliminated.  The  fuzzy 
equations  developed  in  this  thesis  allow  a  number  of  factors 
to  be  considered  in  an  efficient  manner,  and  enables  the 
importance  of  each  factor  to  be  altered.  This  aspect  of  the 
algorithm  is  important  because  it  essentially  provides  a 
method  for  the  program  to  dynamically  restructure  the  deci¬ 
sion  process. 

The  transition  algorithm  discussed  in  Chapter  4  provides 
a  method  to  determine  the  plausibility  of  a  word  matching  a 
speech  file  regardless  of  the  number  of  errors  that  are  made. 
The  advantage  of  using  an  algorithm  such  as  this  one  is  that 
it  does  not  fail  when  there  are  unexpected  errors. 

The  algorithm  can  be  used  for  speaker  dependent  or  in¬ 
dependent  speech,  and  can  be  implemented  using  any  acoustic 
processor  which  outputs  the  features  in  the  format  discussed 
in  Chapter  2,  regardless  of  what  features  are  extracted. 

The  flexibility,  adaptability,  and  restructurability 
exhibited  by  this  algorithm  appear  to  be  similar  to  the 
properties  that  enable  humans  to  perform  speech  recognition, 
and  suggest  that  the  concepts  employed  in  this  thesis  may 
also  be  useful  in  many  other  areas  of  research. 


IX.  Recommendations 

Recommendations  for  possible  improvements  of  the  pro¬ 
posed  word  recognition  algorithm  follow. 

1)  The  algorithm  currently  computes  a  score  for  each 
word  in  the  vocabulary  when  evaluating  a  speech  file.  For 
large  vocabularies  this  approach  may  require  an  excessive 
amount  of  computer  time  to  determine  the  word  spoken.  One 
possible  solution  to  this  problem  that  could  be  easily  imple¬ 
mented  is  to  separate  the  vocabulary  into  classes  of  words. 

The  algorithm  could  then  limit  the  vocabulary  search  by  first 
determining  the  class  of  words  that  is  most  likely  to  have 
been  spoken,  and  then  compute  the  scores  only  for  the  words 

in  this  class.  Another  solution  for  decreasing  the  algorithm's 
processing  time  is  to  use  thresholds  which  would  eliminate 
words  from  further  consideration  as  soon  as  a  word's  score 
falls  below  a  predefined  threshold. 

2)  Alternate  fuzzy  equations  should  be  examined  which 
would  provide  greater  flexibility  and  adaptability.  For 
example,  the  threshold  fuzzy  equations  discussed  in  Chapter 
3  could  be  used.  More  general  fuzzy  equations  should  also 
be  examined  that  would  allow  the  algorithm  to  determine  and 
vary  the  relationships  of  factors  with  one  another.  Equa¬ 
tions  such  as  these  may  eventually  lead  to  completely  restruc- 
turable  decision  processes  that  would  be  able  to  adapt  to  any 
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problem  where  the  relevant  factors  are  known. 

3)  The  transition  algorithm  discussed  in  Chapter  4 
declares  exact  vector  locations  where  transitions  occur  be¬ 
tween  word  phonemes;  however,  examination  of  Seelandt's 
acoustic  output  indicates  that  the  transitions  between  word 
phonemes  may  be  smooth  (fuzzy) .  The  transition  algorithm 
should  be  modified  to  wake  fuzzy  transitions  rather  than 
discrete  transitions,  and  the  performance  of’ the  modified 
algorithm  should  be  evaluated  to  determine  if  the  accuracy 
of  the  algorithm  is  increased. 

4)  The  determine  word  representation  algorithm  presen¬ 
ted  in  Chapter  5  does  not  always  derive  reasonable  phoneme 
representations.  Other  algorithms,  or  variations  of  this 
one,  should  be  developed  to  overcome  this  problem.  An  inter¬ 
esting  solution  to  this  problem  would  be  to  eliminate  the 
need  for  deriving  word  representations  by  relying  solely  on 
the  word  fuzzy  statistics.  This  might  be  accomplished  by 
simply  using  a  single  arbitrary  representation  for  each  word 
to  generate  the  statistics,  and  by  malting  minor  modifications 
to  the  algorithm. 

5)  The  fuzzy  statistics  for  each  word  are  generated 
using  a  set  of  training  speech  files  and  a  set  of  overall 
fuzzy  variables.  The  effect  of  using  a  large  number  of  train¬ 
ing  files  for  each  word,  or  collecting  statistics  for  each 
recognition  attempt  should  be  examined.  Also,  the  use  of 
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individual  word  fuzzy  variables  instead  of  overall  variables 
should  be  considered  for  generating  the  statistics. 

6)  Because  of  the  inefficiency  of  the  fuzzy  variable 
optimization  algorithm  described  in  Chapter  6,  other  tech¬ 
niques  for  optimizing  the  fu?zy  variables  should  be  devel¬ 
oped. 

7)  It  is  believed  that  the  accuracy  of  the  recognition 
algorithm  will  rise  sharply  given  a  more  accurate  acoustic 
processor.  Certainly,  with  errorless  output  from  the  pro¬ 
cessor  and  absolutely  consistent  speech,  it  is  easy  to  show 
that  the  recognition  algorithm  would  achieve  100%  accuracy. 
What  is  not  known  at  this  time  is  the  rate  at  which  it 
degrades  as  the  acoustic  analyzer  makes  more  mistakes,  and 
as  the  speech  input  becomes  more  variable.  Therefore,  con¬ 
tinuation  of  research  for  improving  Seelandt's  algorithm  is 
highly  recommended. 
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individual  word  fuzzy  variables  instead  of  overall  variables 
should  be  considered  for  generating  the  statistics. 

6)  Because  of  the  inefficiency  of  the  fuzzy  variable 
optimization  algorithm  described  in  Chapter  6,  other  tech¬ 
niques  for  optimizing  the  fuzzy  variables  should  be  devel¬ 
oped  . 

7)  It  is  believed  that  the  accuracy  of  the  recognition 
algorithm  will  rise  sharply  given  a  more  accurate  acoustic 
processor.  Certainly,  with  errorless  output  from  the  pro¬ 
cessor  and  absolutely  consistent  speech,  it  is  easy  to  show 
that  the  recognition  algorithm  would  achieve  100*  accuracy. 
What  is  not  known  at  this  time  is  the  rate  at  which  it 
degrades  as  the  acoustic  analyzer  makes  more  mistakes,  and 

<r»  as  the  speech  input  becomes  more  variable.  Therefore,  con¬ 

tinuation  of  research  for  improving  Seelandt's  algorithm  is 
highly  recommended. 
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A  Speech  Understanding  System 


A  speech  understanding  system  (SUS)  is  a  system  capable 
of  recognizing  speech,  and  interpreting  it  to  perform  some 
task.  A  block  diagram  of  a  SUS  is  given  in  Figure  19.  As 
shown,  speech  enters  the  system  through  a  microphone,  and  is 
input  into  an  acoustic  analyzer.  The  acoustic  analyzer  ex¬ 
tracts  features,  such  as  phonemes,  from  the  speech,  and  out¬ 
puts  the  features  to  the  word  sequence  hypothesizer .  The 
word  sequence  hypothesizer  determines  the  most  probable  word 
sequence  that  occurred  from  the  sequence  of  features.  The 
word  sequences  are  then  analyzed  by  the  sentence  analyzer 
which  determines  the  task  to  be  performed.  The  sentence 
analyzer  also  provides  information  to  the  word  sequence  hy¬ 
pothesizer  to  eliminate  the  generation  of  invalid  word  se¬ 
quences.  To  perform  these  tasks,  the  sentence  analyzer  uses 
the  following  information. 

1.  Phonetics.  Phonetics  is  used  to  account  for  the 

different  pronunciations  of  words  based  on  the  con¬ 
text  of  the  surrounding  words,  and  on  typical  varia¬ 
tions  that  are  encountered. 


2.  Syntax.  Syntax  is  used  to  determine  the  validity 
of  a  word  sequence  using  a  set  of  grammatical  rules. 

3.  Semantics.  Semantic  information  is  used  to  determine 
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Pig.  19.  Block  Diagram  of  a  Speech  Understanding  System 


the  meaning  of  a  sequence  of  words. 

4.  Pragmatics.  Pragmatics  is  the  knowledge  of  the 
context  of  the  possible  tasks,  and  what  has  been 
said  previously. 

5.  Prosody.  Prosodic  information  involves  the  inter¬ 
pretation  of  stress  and  intonation  in  terms  of  the 
syntactic  and  semantic  structure  of  a  word  sequence. 

The  purpose  of  this  appendix  is  to  suggest  a  general 
approach  for  developing  a  virtually  non-restricted  speech 
understanding  system.  This  is  accomplished  by  recommending 
a  possible  word  sequence  hypothesizer,  and  by  suggesting  a 
method  for  incorporating  the  above  information  required  by 
the  sentence  analyzer.  Before  an  approach  can  be  established 
the  functional  requirements,  and  possible  restrictions  that 
can  be  employed  in  realizing  a  general  SUS,  must  be  discussed 


Functional  Requirements  and  Possible  Restrictions 

Present  speech  understanding  systems  employ  many  differ¬ 
ent  methods  to  recognize  and  interpret  speech.  To  achieve 
practical  performance  goals,  all  of  these  systems  impose 
severe  restrictions.  Although  most  of  these  restrictions 


must  be  applied  to  some  degree,  they  presently  limit  the  use¬ 
fulness  of  SUS's  to  very  few  applications.  A  discussion  of 
each  of  the  major  restrictions  is  given  below. 


1.  Mode  of  Speech 

This  restriction  concerns  how  close  the  SUS  is  to  recog- 


nizing  natural  conversational  speech.  Some  systems  can  only 
recognize  isolated  words,  i.e.  the  speaker  must  pause  between 
each  word  in  a  sentence.  Other  systems  are  capable  of  recog¬ 
nizing  connected  speech,  but  require  the  speaker  to  be  trained 
to  speak  in  an  unnatural  manner. 

2.  Speaker  Variability 

Speaker  variability  refers  to  the  dialect,  contextual, 

» 

and  acoustic  variations  among  speakers.  Dialectic  variability 
occurs  from  the  differences  in  the  pronunciation  of  words 
among  speakers.  Contextual  variability  involves  changes  in 
the  pronunciation  due  to  the  combinations  of  words.  Acoustic 
variability  results  from  vocal  tract  differences  among  speak¬ 
ers.  Any  or  all  these  types  of  variability  may  occur  when 
the  speaker  is  changed.  An  ideal  system's  performance  would 
not  be  effected  by  speakers  that  have  diverse  speaker  charac¬ 
teristics.  However  present  systems  may  be  adversely  affected 
by  changing  speakers.  It  should  be  noted  that  people  often 
have  trouble  understanding  others  with  different  speech  char¬ 
acteristics. 

A  system's  speaker  variability  is  based  on  the  variety 
of  speakers  allowed,  and  on  whether  or  not  the  system,  or 
speaker,  must  be  trained.  Some  systems  only  allow  speakers 
which  have  certain  characteristics.  For  example,  the  system 
may  require  the  speaker  to  be  male,  or  require  the  speaker 
to  have  a  particular  dialect.  Other  systems  can  only  recognize 


speech  from  certain  speakers,  and  tune  the  system  to  a  new 
speaker  by  requiring  the  speaker  to  repeat  a  set  of  training 
sentences.  Perhaps  the  best  method  of  adapting  the  system 
to  a  speaker  is  to  use  a  dynamic  tuning  algorithm,  where  the 
system  automatically  tunes  itself  to  a  new  speaker  (Ref  13) . 

3.  Language 

Assuming  that  English  is  the  required  language,  the  sys¬ 
tem  language  can  be  natural  English,  a  language  similar  to 
English,  Or  a  command  language.  An  English-like  language 
places  restrictions  on  the  structure  of  the  sentences,  or 
number  of  different  sentences  allowed,  and  usually  depends 
on  the  task  being  performed.  A  command  language  is  highly 
dependent  on  the  task  being  performed,  and  consists  of  a 
very  small  set  of  phrases. 

4.  Environment 

The  environment  of  a  SUS  is  usually  dependent  on  the 
task  to  be  performed.  For  example  a  system  to  be  used  in  an 
office  would  require  less  tolerance  to  noise  than  a  system 
to  be  used  in  a  jet  aircraft.  A  reasonable  environment  for 
an  initial  SUS  would  be  a  computer  room,  however,  some  re¬ 
searchers  believe  that  a  sound  proof  room  should  be  used  for 
initial  system  development. 

5.  Vocabulary  Size 


Ideally  the  vocabulary  of  a  SUS  should  be  unlimited,  how¬ 
ever  the  size  of  the  vocabulary  effects  performance  drastically 


A  reasonable  vocabulary  would  probably  consist  of  3,000  to 
10,000  words.  Other  than  limiting  the  size  of  the  vocabu¬ 
lary,  the  vocabulary  can  be  restricted  by  selecting  words 
for  phonetic  discriminability,  i.e.  selecting  words  to  make 
the  task  of  matching  acoustic  segments  to  the  words  easier. 

If  the  vocabulary  is  selected  with  this  in  mind,  the  ability 
of  the  system  to  adapt  to  changes  in  its  task  would  be  less- 
ened. 

6.  Average  Branching  Factor 

The  average  branching  factor  (ABF)  is  the  average  number 
of  words  that  can  follow  an  initial  sequence  of  words  as 
defined  by  the  grammar.  Usually  a  system's  performance  in¬ 
creases  as  the  ABF  decreases,  as  shown  in  Figure  20  for  the 
HWIM,  SDC,  Hearsay  II,  and  Harpy  speech  understanding  sys¬ 
tems  (Ref  3; 243).  The  reason  for  this  is  that  lower  ABFs 
substantially  limit  the  number  of  possible  words  that  can 
.occur  after  an  initial  sequence.  Thus  the  number  of  searches 
that  must  be  performed  to  determine  the  next  word  in  a  se¬ 
quence  is  reduced.  Since  this  factor  can  only  be  reduced  by 
placing  additional  constraints  on  the  grammar,  it  is  perhaps 
the  best  measure  of  the  constraints  placed  on  a  system. 

7.  Task  Orientation 

This  restriction  concerns  the  complexity  of  the  tasks 
performed  by  the  system,  and  the  dependence  of  a  system  on 
the  task.  The  complexity  can  range  from  simple  tasks,  such 
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as  moving  a  small  number  of  objects  in  a  simple  environment, 
to  very  complex  tasks  which  may  involve  a  large  number  of 
simpler  tasks .  The  degree  of  dependence  of  a  SUS  on  the 
task  can  dramatically  effect  the  performance  of  the  system, 
and  is  closely  related  to  the  other  restrictions  placed  on 
the  system.  For  example,  if  a  simple  task,  such  as  moving 
blocks,  is  to  be  performed,  the  language  necessary  would 
only  have  to  consist  of  a  few  simple  sentences,  and  the  vo¬ 
cabulary  would  only  have  to  include  a  few  words.  A  low  ABF 
is  a  sign  that  the  system  is  task  dependent. 

Taking  into  consideration  the  above  restrictions,  and 
other  factors,  the  functional  requirements  for  a  reasonable 
SUS  are  given  below. 

1.  The  mode  of  speech  should  be  natural  conversational 
speech. 

2.  The  system  should  be  capable  of  accurately  recog¬ 
nizing  the  speech  of  a  number  of  various  speakers, 
i.e.  speakers  of  both  sexes,  and  speakers  with 
different  dialects. 

3.  The  language  should  be  natural  English. 

4.  The  system  should  be  capable  of  handling  a  large 
number  of  different  tasks,  or  should  be  capable  of 
being  easily  changed  to  perform  various  tasks. 

5.  The  size  of  the  vocabulary  should  be  approximately 
3,000  words.  A  system  with  a  vocabulary  of  this 
size  will  be  capable  of  performing  a  large  range  of 
tasks. 

6.  The  system  should  operate  efficiently  in  environ¬ 
ments  with  noise  levels  equivalent  to  those  of  the 
intended  applications. 

7.  The  system  should  have  accuracy  rates  that  are  very 
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high.  The  accuracy  should  be  based  on  the  percen¬ 
tage  of  sentences  interpreted  correctly,  since  many 
errors  may  not  effect  the  task  to  be  performed.  If 
a  rejection  criterion  is  used,  the  number  of  rejec¬ 
tions  should  be  small. 

8.  The  system  should  operate  in  real  time. 

9.  The  system  should  be  flexible  in  that  it  should  be 
able  to  adapt  changes  in  the  task,  and  small  varia¬ 
tions  in  the  algorithms  employed. 

10.  The  system  should  be  reliable  in  terms  of  the  mean 
time  between  failures. 

11.  The  system  should  be  able  to  determine  when  one  of 
its  components  is  not  functioning  properly.  It 
would  also  be  desirable  to  be  able  to  bypass  mal¬ 
functioning  components . 

At  present  no  systems  meet  these  requirements.  To  fur¬ 
ther  complicate  the  development  of  a  general-purpose  SUS,  is 
the  fact  that  the  trade-offs  involved  among  these  require¬ 
ments  make  it  difficult  to  compare  the  different  architectures 
of  present  SUS's.  For  example,  a  system  with  a  high  task 
dependence  usually  obtains  more  accurate  results,  and  oper¬ 
ates  much  faster  than  a  system  with  less  dependence  on  the 
task  being  performed.  Therefore  it  would  appear  that  the 
faster  and  more  accurate  system  is  better,  which  may  not  be 
true.  Obviously  there  are  many  practical  applications  for 
SUS's  which  do  not  meet  the  above  requirements,  and  place 
severe  restrictions  on  the  task?  however,  there  are  not  any 
existing  speech  understanding  system's  other  than  ones  in  the 
experimental  stage. 
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Recommended  Approach 

The  purpose  of  this  section  is  to  present  an  approach 
for  developing  a  general  speech  understanding  system  which 
meets  the  functional  requirements  given  in  the  preceeding 
section.  This  approach  is  designed  to  provide  many  parallel 
areas  of  research  that  can  be  considered  independently.  In 
fact  it  is  strongly  recommended  that  most  of  the  areas  be 
researched  simultaneously.  The  method  for  developing  an  SUS 
should  be  based  on  the  application  of  restrictions  in  a  log¬ 
ical  manner  that  will  eventually  lead  to  a  virtually  unre¬ 
stricted  SUS.  The  7  research  steps  that  will  be  presented 
to  achieve  this  goal  are  shown  below. 

1.  Development  of  a  Feature  Extraction  Process. 

2.  Development  of  an  isolated  word  recognition  algorithm. 

3.  Development  of  a  connected  speech  algorithm. 

4.  Incorporation  of  phonetic  knowledge. 

5.  Incorporation  of  syntactic  and  semantic  information. 

6.  Incorporation  of  pragmatics  and  prosody. 

7.  Development  of  a  SUS  computer  architecture. 

Although  the  above  steps  suggest  a  sequential  approach, 

it  is  recommended  that  these  steps  be  independently  researched 
and  developed.  The  algorithms  suggested  in  the  following 
discussion  of  each  step  are  designed  to  impose  the  least  num¬ 
ber  of  restrict  is  necessary  to  realize  a  general  SUS. 


1.  Development  of  a  Feature  Extraction  Process 


There  are  basically  two  type  of  feature  extraction  al¬ 
gorithms:  word  based,  and  phoneme  based  algorithms.  Word 
based  algorithms  use  the  actual  words  as  the  features,  and 
attempt  to  identify  to  the  word  that  was  uttered  directly 
from  the  speech.  Because  of  memory  and  other  constraints, 
these  algorithms  severely  limit  the  size  of  the  vocabulary, 
and  tend  to  limit  the  flexibility  of  the  entire  system. 

Phoneme  based  systems,  on  the  other  hand,  potentially  do  not 
place  such  restrictions  on  the  system,  and  therefore  provide 
much  greater  flexibility.  Also,  more  flexibility  is  obtained 
by  these  systems  since  they  can  be  designed  completely  inde¬ 
pendent  of  the  other  processes  required  to  understand  speech. 
The  word  recognition  algorithm  developed  in  this  thesis  veri¬ 
fies  this  fact  because  it  can  be  used  with  any  feature  ex¬ 
traction  process  that  outputs  strings  of  phoneme  guesses. 

The  acoustic  process  developed  by  Karl  Seelandt  appears  to 
represent  the  type  of  algorithm  that  should  be  used  for  ex¬ 
tracting  phonemes. 

The  only  restrictions  that  must  be  overcome  in  Seelandt' s 
algorithm  are  the  "speaker  variability"  and  "environment" 
restrictions  discussed  earlier.  Initial  results  obtained 
from  this  algorithm  indicate  that  future  improvements  of  this 
process  may  eventually  overcome  these  restrictions.  Currently, 
several  researchers  are  attempting  to  make  these  improvements, 
and  continued  research  in  this  area  is  recommended. 
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As  shown  in  this  thesis,  the  accuracy  of  the  acoustic 
process  does  not  have  to  be  as  high  as  might  be  expected  to 
determine  the  words  spoken.  Because  of  this  fact,  and  be¬ 
cause  perfect  feature  extraction  is  virtually  impossible, 
the  feature  extraction  research  should  be  conducted  in  par¬ 
allel  with  the  remaining  areas  of  research  to  be  discussed 
in  this  section. 

2.  Development  of  an  Isolated  Word  Recognition  Algorithm 

Before  attempting  to  recognize  continuous  speech,  it  is 
logical  to  first  develop  an  isolated  word  recognition  algo¬ 
rithm.  Unlike  the  previous  step,  an  approximate  accuracy 
goal  for  this  algorithm  can  be  established  as  the  accuracy 
obtainable  by  humans  for  isolated  word  recognition.  The 
achievement  of  this  accuracy  will  be  dependent  on  the  deci¬ 
sion  scheme  used,  and  on  the  accuracy  of  the  acoustic  analy¬ 
zer. 

The  word  recognition  algorithm  developed  in  this  thesis 
is  capable  of  achieving  reasonable  accuracy  for  a  small  voca 
bulary  despite  the  inadequacy  of  the  feature  extraction  algo 
rithm.  Both  of  these  algorithms  must  be  improved  consider¬ 
ably  to  obtain  the  same  accuracy  obtained  by  humans.  Since 
the  only  restriction  that  should  be  placed  on  the  word  recog 
nition  algorithm  is  the  size  of  the  vocabulary,  it  is  recom¬ 
mended  that  the  initial  vocabulary  be  small,  and  increased 
slowly  as  the  recognition  accuracy  increases. 
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The  recognition  algorithm  presented  in  Chapter  4  offers 
a  number  of  advantages  over  existing  algorithms  used  to 
determine  the  word  spoken.  One  advantage  not  previously 
discussed  is  that  the  number  of  operations  required  to  deter¬ 
mine  the  word  spoken  is  equal  to  the  number  of  words  in  the 
vocabulary  times  the  number  of  operations  to  score  one  word. 
Therefore  an  increase  in  the  vocabulary  size  does  not  signi¬ 
ficantly  effect  the  algorithm's  execution  time.  However, 
the  time  required  to  execute  the  algorithm  in  its  current 
state  is  extensive.  Since  it  will  be  necessary  to  continu¬ 
ally  experiment  with  variations  of  this  algorithm  to  improve 
it,  it  would  be  advantageous  to  minimize  the  time  required 
to  execute  the  algorithm,  'uzis  can  be  accomplished  by  opti¬ 
mizing  the  algorithm's  code,  separating  the  words  into  word 
classes  (as  discussed  in  Chapter  9) ,  and  by  developing  a 
specialized  computer  system  for  implementing  the  algorithm. 

Since  the  same  operations  are  performed  to  score  each 
word,  a  computer  architecture  can  easily  be  developed  which 
would  decrease  the  execution  time.  Figure  21  illustrates  a 
block  diagram  of  a  multiprocessor  computer  system  that  would 
score  a  number  of  words  in  parallel,  and  that  would  still 
allow  major  modifications  to  be  made  in  the  algorithm.  As 
shown,  the  acoustic  processor  outputs  the  features  to  each 
of  the  N  word  processors.  The  supervisor  then  assigns  each 
word  processor  a  word  from  the  vocabulary  to  score.  After  a 
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Fig.  21.  Block  Diagram  of  a  Possible  Computer  Architecture  for 
Implementing  the  Word  Recognition  Algorithm. 


word  processor  scores  a  word,  it  returns  the  word  score  to 
the  supervisor,  and  waits  for  the  next  word.  This  process 
is  repeated  until  all  the  words  have  been  scored.  When  this 
occurs  the  supervisor  then  determines  which  word  was  spoken 
based  on  the  word  scores  received  from  the  word  processors. 

It  must  be  understood  that  the  purpose  of  increasing 
the  recognition  algorithm's  efficiency  is  to  aid  the  researcher 
in  determining  what  modifications  should  be  made  to  the  algo¬ 
rithm,  and  not  to  achieve  real  time  performance. 

3.  Development  of  a  Connected  Speech  Algorithm 

Once  the  isolated  word  recognition  algorithm  obtains  a 
reasonable  accuracy,  this  algorithm  should  be  modified  to 
recognize  words  that  appear  in  connected  speech.  This  should 
be  accomplished  by  first  expanding  the  isolated  word  recogni¬ 
tion  algorithm  to  locate  words  in  a  word  string.  One  possible 
method  for  doing  this  involves  the  use  of  thresholds,  and  the 
concept  of  fuzzy  word  locations. 

Thresholds  can  be  psed  to  establish  potential  beginning 
and  end  points  for  words  appearing  in  a  word  string.  This 
can  be  accomplished  by  requiring  word's  to  have  scores  above 
a  predefined  threshold  before  they  are  considered.  Fuzzy 
word  locations  allow  a  word's  end  points  to  be  fuzzy,  i.e. 
the  exact  location  of  a  word  does  not  have  to  be  specified 
(Ref  16:302-303).  This  strategy  eliminates  the  possibility 
of  examining  duplicate  word  hypothesis  for  the  same  word  in 


similar  locations,  and  allows  word  phrases  to  be  formed  with¬ 
out  determining  the  exact  word  boundaries.  Also,  errors  and 
overlapping  that  occur  between  words  can  easily  be  overcome 
using  this  approach. 

When  reasonable  accuracies  are  obtained  for  determining 
the  possible  words,  and  locations  of  these  words,  in  a  word 
string;  an  algorithm  which  hypothesizes  word  sequences  should 

9 

be  developed.  An  outline  of  a  possible  word  sequence  hypoth- 
esizer,  based  on  fuzzy  word  locations  is  shown  in  Figure  22. 

As  an  example  of  the  algorithm,  consider  the  words  "one" 
through  "seven",  and  assume  that  the  possible  end  points  of 
these  words  in  a  word  string,  shown  in  Figure  23,  were  ob¬ 
tained  from  applying  step  1  of  the  algorithm.  Execution  of 
step  2  of  the  algorithm  will  then  produce  the  tree  shown  in 
Figure  24,  and  the  application  of  steps  3  through  6,  with  X 
equal  to  three,  will  produce  the  possible  sequences  given  in 
Figure  25. 

After  all  the  possible  word  sequences  are  produced,  each 
sequence  can  be  assigned  a  score  by  summing  the  fuzzy  word 
scores  of  each  word  in  the  sequence.  Many  of  the  hypothesized 
sequences  can  then  be  eliminated  based  on  a  predetermined 
threshold,  or  by  only  considering  the  top  choices.  Final 
sentence  scores  can  then  be  obtained  by  combining  the  word 
representations  and  word  statistics  (used  in  the  word  recog¬ 
nition  algorithm)  of  all  the  words  in  a  given  sequence  to 
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1.  Find  all  possible  word  matches  in  word  string 
using  thresholds  to  determine  end  points.  End 
points  should  be  chosen  to  minimize  the  word 
lengths . 

2.  Generate  a  tree  using  inorder  traversal  (Ref 
6:243)  based  on  the  beginning  vector  of  each 
word. 

3.  Form  a  sentence  by  traversing  the  tree  inorder, 
choosing  the  next  word  in  the  sentence  as  the 
first  word  satisfying  the  expression 

(Beginning  of  next  word)  >=  (end  of  last  word  -  X) 

and  repeat  until  no  more  words  satisfy  the  above 
expression.  The  variable  "X"  in  the  above  ex¬ 
pression  is  used  to  account  for  possible  over¬ 
lapping  of  words . 

4.  Eliminate  last  word  of  previous  sentence,  and 
repeat  step  3  until  the  following  expression 
is  true: 

(Beginning  of  next  word)  >=  (end  of  word  being 

replaced) 

5.  Repeat  step  4  by  eliminating  last  word  examined 
until 

(Beginning  of  next  word)  >=  (end  of  word  being 

examined) 

6.  Repeat  step  5  until  the  first  word  has  been 
examined . 


Fig.  22.  Possible  Algorithm  for  Generating 
Sentence  Hypotheses 
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Fig.  23.  Result  of  Applying  Step  1  of  Fig.  24 
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Fig.  23.  Result  of  Applying  Step  1  of  Fig.  24 
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SENTENCE  NO 


POSSIBLE  SENTENCES 


ONE- THREE-FIVE-SEVEN 

ONE-THREE-SIX 

ONE-FOUR-SIX 

ONE- FOUR- SEVEN 

TWO-FOUR-SEVEN 

TWO- FOUR-SEVEN 

TWO-FIVE-SEVEN 


Fig.  25.  Results  of  Applying  Steps  3  Through  6 
of  Fig.  24  with  X  Equal  to  Three 


form  a  phrase,  and  by  applying  a  modified  version  of  the 

isolated  word  recognition  algorithm  which  would  treat  the 

phrases  as  words.  The  phrase  with  the  highest  score  would 

be  the  one  chosen  as  the  sequence  actually  spoken. 

4.  Incorporation  of  Phonetic  Knowledge 

The  phonetic  knowledge  to  be  incorporated  at  this  stage, 

refers  only  to  the  phonetic  information  for  the  errors  that 

» 

occur  between  words  when  they  are  spoken  together.  Although 
this  information  can  be  incorporated  after  the  algorithm 
developed  in  the  previous  step  achieves  reasonable  accuracy, 
it  is  recommended  that  this  knowledge  be  incorporated  as 
soon  as  an  initial  connected  speech  algorithm  is  developed. 
There  are  two  major  reasons  for  this  approach:  1)  the  accur¬ 
acy  of  the  connected  speech  algorithm  may  be  significantly 
affected  by  the  errors  that  occur  between  words;  and  2)  ex¬ 
tensive  modifications  of  the  connected  speech  algorithm  may 
be  required  to  incorporate  this  knowledge. 

Phonetic  information  can  be  incorporated  by  implementing 
a  set  of  phonological  rules,  or  by  using  statistics  similar 
to  the  word  statistics  used  in  this  thesis.  For  the  reasons 
presented  in  Chapter  4,  statistics  for  the  errors  that  occur 
between  words  would  be  preferable;  however,  because  of  memory 
constraints,  and  difficulties  which  were  not  encountered 
when  collecting  isolated  word  statistics,  this  approach  may 
not  be  practical. 
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5.  Incorporation  of  Syntactic  and  Semantic  Information 

Only  the  following  restrictions  should  be  considered 
in  the  previous  steps: 

1.  mode  of  speech; 

2.  speaker  variability; 

3.  environment;  and 

4.  vocabulary  size. 

This  step  must  consider  the  remaining  restrictions  (language, 
ABF,  and  task  orientation) .  The  simplest  and  most  restricted 
method  for  incorporating  syntactic,  and  semantic  knowledge 
is  to  simply  limit  the  number  of  possible  sentences  that  can 
occur.  This  is  exactly  what  was  done  in  the  Harpy  SUS  devel¬ 
oped  at  Carnigie-Mellon  University  (Ref  10) .  The  Harpy  sys¬ 
tem  employs  a  state  transition  network  which  is  formed  from 
all  the  possible  pronunciations  of  a  set  of  allowable  sen¬ 
tences.  This  approach  was  taken  to  increase  the  SUS  perfor¬ 
mance,  and  to  overcome  the  lack  of  practical  decision  algo¬ 
rithms  capable  of  producing  reasonable  sentence  hypotheses. 
Obviously  the  Harpy  is  extremely  inflexible,  and  can  never 
be  used  as  the  basis  of  a  general  SUS. 

Many  researchers  claim  that  humans  must  also  rely  on 
severe  restrictions  at  this  level  to  understand  speech;  how¬ 
ever  the  author  firmly  believes  that  although  these  restric¬ 
tions  are  indeed  used  extensively  by  humans,  the  importance 
of  them  have  been  over-estimated.  Syntactic  and  semantic 
information  should  be  incorporated  without  placing  unnecessary 
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restrictions  on  the  system.  Restrictions  should  only  be 
used  to  aid  in  the  development  of  the  general  algorithms. 

For  example,  a  limited  vocabulary  could  be  used  to  limit 
the  time  required  to  experiment  with  the  algorithms  being 
developed,  but  the  algorithms  should  remain  independent  of 
the  exact  vocabulary  chosen. 

Before  semantics  is  considered,  a  procedure  which  parses 
an  error-free  sentence  using  only  syntactic  information 
should  be  developed.  Such  an  algorithm  has  been  developed 
by  Milne  (Ref  14) .  This  algorithm  should  then  be  modified 
to  determine  if  a  given  sentence  is  valid,  and  then  incor¬ 
porated  with  the  word  sequence  hypothesizer  to  eliminate 
invalid  word  sequences  as  they  occur,  rather  than  consider 
ing  them  further. 

As  the  syntactic  algorithms  are  developed,  semantic 
algorithms  which  determine  the  meaning  of  a  sentence  using 
syntax  and  the  meanings  of  words,  should  be  developed.  It 
may  be  advisable  to  develop  this  algorithm  by  initially  plac¬ 
ing  restrictions  on  the  task  to  be  performed,  and  then  by 
expanding  the  algorithm  to  handle  more  complicated  tasks; 
however  the  algorithm  should  remain  independent  of  the  spe¬ 
cific  semantics  involved.  Fuzzy  set  theory  may  prove  to  be 
invaluable  in  determining  which  task  (or  tasks)  is  to  be 
performed  when  the  action  required  is  unclear,  or  when  the 
t-sk  requires  the  system  to  make  complex  decisions.  After 


this  algorithm  is  developed,  it  should  be  modified  to  eli¬ 
minate  invalid  sentences  that  were  not  eliminated  by  the 
syntactic  algorithm,  and  also  incorporated  in  the  word 
sequence  hypothesizer . 

6.  Incorporation  of  Pragmatics  and  Prosodies 

Although  pragmatic  and  prosodic  knowledge  are  completely 
different,  both  of  these  knowledge  sources  can  be  used  to 
eliminate  ambiguities  that  arise  in  the  syntactic  and  seman¬ 
tic  algorithms  developed  in  the  previous  step.  This  informa¬ 
tion  can  also  be  used  to  increase  the  system's  performance. 
Since  this  information  is  not  vital  for  obtaining  a  reason¬ 
able  SUS,  it  should  not  be  employed  until  after  reasonable 
success  has  been  attained  by  the  SUS.  Implementation  of 
pragmatic  or  prosodic  knowledge  at  an  earlier  stage  of  devel¬ 
opment  will  complicate  the  generation  of  the  more  vital 
algorithms,  and  will  consume  needlessly  many  hours  of  the 
researchers  time. 

7.  Development  of  a  SUS  Computer  Architecture 

The  practical  implementation  of  the  necessary  algorithms 
for  speech  understanding  will  most  likely  require  a  special¬ 
ized  computer  architecture  to  obtain  real  time  performance. 
Although  standard  computer  systems  should  be  used  where  pos¬ 
sible  in  the  development  of  these  algorithms,  it  is  illogical 
to  constrain  a  SUS  to  operate  on  a  general-purpose  computer 
system. 

To  design  or  choose  an  appropriate  computer  system  for 


a  speech  understanding  system,  the  following  factors  must 
be  considered. 

1.  Performance.  The  architecture  chosen  must  be  capable  of 
achieving  real  time  operation.  The  algorithms  developed  for 
speech  recognition  must  be  designed  with  the  expectation 
that  only  reasonable  advancements  in  computer  technology 
will  be  made  in  the  near  future,  and  not  with  the  hope  that 
major  technological  breakthroughs  will  be  made. 

2.  Flexibility.  The  architecture  should  place  a  minimum 
number  of  constraints  on  the  speech  understanding  algorithms. 
The  system  should  remain  as  flexible  as  possible  to  allow 
the  algorithms  to  be  modified  easily.  This  aspect  of  the 
computer  system  is  exceptionally  important  since  a  flexible 
architecture  will  enable  researchers  to  use  the  system  to 
aid  in  the  design  of  the  required  algorithms. 

3.  Reliability.  The  computer  system  must  be  reliable. 
Excessively  complex  systems  tend  to  be  less  reliable  than 
simple  ones,  and  therefore  should  be  avoided. 

4.  Fault  Tolerance.  This  aspect  concerns  the  interaction 
of  the  computer  system's  components  to  identify  components 
that  are  malfunctioning,  and  considers  the  system's  ability 
to  distribute  the  functions  of  the  malfunctioning  component 
while  minimizing  the  effects  on  performance.  Fault  tolerance 
also  refers  to  how  well  a  system's  performance  and  accuracy 
degrades  as  different  components  malfunction. 

5.  Cost.  Obviously  the  cost  of  the  final  system  is  very 


important.  Although  every  attempt  should  be  made  to  mini¬ 
mize  the  cost,  it  should  be  anticipated  that  the  expense  of 
developing  an  initial  system  may  become  very  high.  There¬ 
fore,  before  any  attempt  is  made  to  implement  a  proposed 
architecture,  detailed  simulations  of  the  system  should  be 
accomplished . 

Before  attempting  to  design  or  choose  a  SUS  architec¬ 
ture,  the  Hearsay  II  SUS  developed  at  Carnegie-Mellon 
University  should  be  examined  (Refs  5,  11,  and  12) .  Consid¬ 
ering  the  above  factors,  the  Hearsay  II  undoubtably  repre¬ 
sents  the  most  noteworthy  achievement  of  a  SUS  architecture. 
When  examining  this  system,  emphasis  should  be  placed  on  its 
architecture,  and  not  on  the  methods  employed  to  solve  the 
speech  understanding  problem.  Because  of  the  complexity  of 
this  system,  it  will  not  be  discussed  in  this  appendix,  and 
the  reader  is  encouraged  to  see  the  references. 

RemgrKg 

The  approach  presented  in  this  appendix  is  intended  to 
provide  future  researchers  with  a  general  guideline  for 
developing  a  speech  understanding  system.  It  is  very  impor¬ 
tant  that  any  research  in  this  area,  other  than  feature 
extraction  research  (step  1) ,  be  performed  with  an  under¬ 
standing  of  the  entire  problem,  and  not  just  segments  of 
the  problem.  The  recommended  steps  are  designed  to  indicate 
a  number  of  parallel  res  rch  efforts  that  can  eventually 
be  combined  in  a  logical  manner  to  realize  a  general  SUS. 
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Computer  Program  Structure  Charts 

This  appendix  contains  structure  charts  for  program 
LEARNWORD  shown  in  Appendix  D.  The  format  of  the  charts 
that  appear  in  this  appendix  is  shown  in  Figure  26,  and  a 
general  presentation  of  structure  charts  can  be  found  in 
Structured  Analysis  written  by  Victor  Weinberg. 

The  main  purpose  of  this  appendix  is  to  illustrate  the 
physical  structure  of  program  LEARNWORD.  The  structure 
charts  that  follow  should  be  used  in  conjunction  with  the 
program's  internal  documentation  to  obtain  a  thorough  under¬ 
standing  of  the  program. 

It  should  be  noted  that  programs  LEARNWORD  and  OUTSTAT, 
shown  in  Appendix  D,  are  written  in  Rational  Data  Systems, 
Incorporated  Pascal,  and  implement  some  non-standard  func¬ 
tions.  The  remaining  programs  are  written  in  Data  General 
Corporation  Fortran  V. 


"FUNCTION" 


Reference  to  the  structure  chart 
for  this  module. 


•  Indicates  that  a  decision  must  be 
made  to  determine  which  module  (if 
any)  is  to  be  called. 


The  corresponding  procedure  within 
the  program. 


-^Indicates  that  these  modules  are 
called  repetitively  until  some 
criterion  is  satisfied. 


Fig.  26.  Legend  for  the  Structure  Charts 
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Collect 


Pig  28.  Structure  Chart  1.0:  Collect  Statistics  on  a  Speech  File 


Fig.  29.  Structure  Chart  2.0:  Determine  Word  Spoken. 
Procedure  "ISORECOG" 
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Fig.  31.  Structure  Chart  4.0:  Manually  Change  Word  Phoneme  Representation. 
Procedure  " CHNGEREP " 
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Fig.  33.  Structure  Chart  6.0:  Run  an  Automated  Procedure 
Procedure  "AUTOMAT" 
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Fig.  34.  Structure  Chart  6.1:  Store  Speech  Files. 
Procedure  "STORESPEECH" 


Fig.  35.  Structure  Chart  6.2:  Automatic  Word  Recognition. 
Procedure  "RECOGALL" 
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Fig.  38.  Structure  Chart  6.5:  Optimize  the  Word  Fuzzy  Variables. 
Procedure  "FUZZYOPT" 
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Fig.  39.  Structure  Chart  6.5.1:  Optimize  Word  Variables 


Chart  6.5.2:  Compute  Word  Recognition  Results 
"OPTRECOG" 


Chang 


Fig.  41.  Structure  Chart  6.6:  Change  Optimum  Fuzzy  Variables. 
Procedure  "CHNGECPT" 


NOTE  *  -  This  function  varies  depending  on  the  number  of  word  phonemes  and  vectors 
remaining.  Refer  to  procedures  WGE2SGE3,  WGE2SLE2,  WEQ1SGE3 ,  WEQ1SLE2 . 
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Structure  Chart  B.0:  Generate  Word  Statistics 
Procedure  "INITSTAT" 
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Fig.  44.  Structure  Chart  C.O:  Derive  an  Initial  Representation. 
Procedure  "AUTODET" 


all 


Fig.  45.  Structure  Chart  D.O:  Score  All  Stored  Files  for  a  Word 
Procedure  "WORDRECOG" 


PROGRAM  LEARNWORD  DOCUMENTATION 

WRITTEN  BY:  2LT  GERARD  J,  MONTGOMERY 
DATE:  15  JUNE  1962 


GENERAL  PROGRAM  DESCRIPTION 


THIS  PROGRAM  PROVIDES  THE  CAPABILITY  AND  SUPPORT  NECESSARY  TO 
PERFORM  ISOLATED  WORD  RECOGNITION  GIVEN  THE  OUTPUT  OF  AN  ACOUSTIC 
PROCESSOR.  THE  ONLY  FUNCTION  THAT  IS  NOT  PROVIDED  IN  THIS  PROGRAM, 

IS  A  FUNCTION  TO  OUTPUT  THE  STATISTICS  AND  RECOGNITION  RESULTS  COLLECTED 
IN  FILES  SFILE  AND  UFILE.  THIS  FUNCTION  IS  PROVIDED  BY  PROGRAM 
OUTSTAT.  THE  MAIN  FUNCTIONS  THAT  PROGRAM  LEARNWORD  PROVIDES 
INCLUDE: 

IA.  COLLECTING  STATISTICS  ON  A  GIVEN  SPEECH  FILE  :TO  BE  USED 
FOR  TRAINING). 

IB.  ADDING  A  NEW  WORD  TG  THE  VOCABULARY,  AND  INITIALIZING  THE 
NECESSARY  DATA  STRUCTURES  FOR  THE  WORD, 

2.  RECOGNIZING  THE  WORD  SPOKEN  IN  A  GIVEN  SPEECH  FILE. 

3.  DETERMINING  A  WORD'S  PHONEME  REPRESENTATION  FROM  THE 
SPEECH  FILES  USED  FOR  TRAINING. 

4.  CHANGING  THE  PHONEME  REPRESENTATION  OF  A  WORD. 

5.  CHANGING  THE  VALUES  OF  THE  OVERALL  OR  WORD  FUZZY  VARIABLES. 

6.  RUNNING  AN  AUTOMATED  PROCEDURE  (REFER  TO  THE  DESCRIPTION  FOR 
PROCEDURE  AUTOMAT). 


NOTE:  A  DETAILED  PRESENTATION  OF  THE  PROGRAM'S  STRUCTURE  IS  PROVIDED 
BY  THE  STRUCTURE  CHARTS  SHOWN  IN  APPENDIX  1. 


MAIN  DATA  STRUCTURES 


INPUT/OUTPUT  DATA  FILES: 

0UT2  *  TEXT  FILE  USED  FOR  SPEECH  FILE  INPUT. 

RESULTS  -  OUTPUT  TEXT  FILE  FOR  STORING  RECOGNITION 
RESULTS. 

STATSOUT  -  OUTPUT  TEXT  FILE  FOR  OBTAINING  WORD  STATISTICS, 
AND  OTHER  OUTPUT  DATA, 


BATA  FILES: 


WFILE*  -  A  RANDOM-ACCESS  FILE  CONSISTING  OF  ONE 

RECORD  PER  WORD  IN  THE  VOCABULARY.  THE  PURPOSE 
OF  THIS  FILE  IS  TO  MAINTAIN  GENERAL  INFORMATION 
FOR  EACH  WORD  IN  THE  VOCABULARY.  EACH  RECORD 
CONTAINS  THE  FOLLOWING  DATA: 

VARIABLE  DESCRIPTION 


SPELLING  -  WORD  SPELLING 

PHREP  -  ARRAY  CONTAINING  WORD'S  PHONEME  REPRESENTATION. 
USTHR,USUBE,WSUBF,UINSE, 

WINSF ,WDELE , VDELF , 

WDELG ,WDCNE  ,WDCNF , 

WDCNG/WSFE  ,USFF , 

UCHVE,WCHVF, 

WSTATE,WSTATF,WSTATG, 

WTHR1E/WTHR1F  ,WTHR2E, 

WTHR2F  -  WORD'S  FUZZY  VARIABLES  (SEE 

THE  OVERALL  FUZZY  VARIABLES  DESCRIPTIONS 
FOR  AN  EXPLANATION  OF  A  SPECIFIC  VARIABLE). 


SF1LE*  -  A  RANDOM-ACCESS  FILE  CONSISTING  OF  ONE 

RECORD  PER  WORD  IN  THE  VOCABULARY.  THE  PURPOSE 
OF  THIS  FILE  IS  TO  MAINTAIN  STATISTICAL  DATA 
FOR  EACH  WORD  IN  THE  VOCABULARY.  EACH 
RECORD  CONTAINS  THE  FOLLOWING  DATA: 

VARIABLE  DESCRIPTION 


NOSPOKE  -  HUMBER  OF  TRAINING  FILES  USED  TO 
OBTAIN  THE  WORD  STATISTICS. 

NOATTEMPT  -  NUMBER  OF  RECOGNITION  ATTEMPTS  HADE 
FOR  THE  WORD. 

NOCORRECT  •  NUMBER  OF  TIMES  THE  WORD  WAS  CORRECTLY 
RECOGNIZED. 

NDEL  -  ARRAY  CONTAINING  THE  NUMBER  OF  TIMES 

EACH  WORD  PHONEME  WAS  DELETED  IN  THE 
TRAINING  SET, 

XSUBY  -  ARRAY  CONTAINING  A  SCORE  BASED  ON  THE 

NUMBER  OF  TIMES  EACH  PHONEME  WAS  SUBSTITUTED 
FOR  EACH  WORD  PHONEME. 

XINAFTZ  -  ARRAY  CONTAINING  A  SCORE  BASED  ON  THE 

NUMBER  OF  TIMES  EACH  PHONEME  WAS  INSERTED 
FOR  EACH  WORD  PHONEME. 

TOTSCORE  -  TOTAL  RECOGNITION  SCORE. 

TOTDIFF  -  TOTAL  DIFFERENCE  BETWEEN  THE  WORD'S  RECOG¬ 
NITION  SCORE  AND  THE  NEXT  HIGHEST  SCORE, 

(ONLY  CALCULATED  WHEN  WORD  WAS  CORRECTLY 
RECOGNIZED) 

HINDIFF  *  THE  MINIMUM  DIFFERENCE  BETWEEN  THE  WORD'S 

RECOGNITION  SCORE  AND  THE  NEXT  HIGHEST  SCORE. 
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(ONLY  CALCULATED  WHEN  WORD  WAS  CORRECTLY 
RECOGNIZED) 


SPECH*  - 


RECOG*  - 


FUZZYVAR 


OPTFUZZY 


A  RANDOM- ACCESS  FILE  CONSISTING  OF  ONE  OR  FIORE  RECORDS 
PER  SPEECH  FILE,  THE  PURPOSE  OF  THIS  FILE  IS  TO 
MAINTAIN  A  NUMBER  OF  SPEECH  FILES  THAT  CAN  BE  EASILY 
ACCESSED  BY  THE  PROGRAM  TO  INITIALIZE  WORD  STATISTICS  , 
AND  TO  DETERMINE  A  WORD'S  PHONEME  REPRESENTATION, 

EACH  RECORD  CONTAINS  THE  FOLLOWING  DATA: 

VARIABLE  DESCRIPTION 


SPELL 

CONTFLAG 


SPKR 

ICHPHONE 

ICHVALUE 

ISFACTOR 


IPHONE 


ISCORE 


IVERROR 


*  EXACT  SPELLING  OF  THE  WORD  SPOKEN. 

-  CONTINUATION  FLAG  INDICATING  WHETHER 

OR  NOT  THE  NEXT  RECORD  IS  A  CONTINUATION 
OF  THE  CURRENT  SPEECH  FILE. 

-  THE  SPEAKER  OF  THE  WORD. 

-  ARRAY  CONTAINING  THE  TOP  PHONEME  GUESSES 
FOR  EACH  SEGMENT  OF  SPEECH. 

-  ARRAY  CONTAINING  A  RELATIVE  WEIGHT  FOR  EACH 
PHONEME  GUESS. 

-  ARRAY  CONTAINING  A  RELATIVE  WEIGHT  FOR  EACH  VECTOR 
OF  SPEECH  INDICATING  THE  LEVEL  OF  CONFIDENCE 

THE  ACOUSTIC  PROCESSOR  HAD  IN  CHOOSING  THE  GIVEN 
PHONEMES  FOR  A  SEGMENT  OF  SPEECH. 

-  ARRAY  USED  BY  THE  RECOGNITION  ROUTINE  TO  INDICATE 
THE  WORD  PHONEME  CHOSEN  FOR  EACH  VECTOR  IN  THE 
SPEECH  FILE, 

-  ARRAY  USED  BY  THE  RECOGNITION  ROUTINE  TO  INDICATE 
A  SCORE  FOR  THE  WORD  PHONEME  CHOSEN 

FOR  EACH  SPEECH  VECTOi,, 

-  ARRAY  USED  BY  THE  RECOGNITION  ROUTINE  TO  INDICATE 
THE  ERROR  THAT  OCCURRED  FOR  EACH  SPEECH  VECTOR. 


IDENTICAL  TO  SPECH,  EXCEPT  THE  PURPOSE  OF  THIS  FILE  IS  TO 
MAINTAIN  A  NUMBER  OF  SPEECH  FILES  THAT  CAN  BE  EASILY 
ACCESSED  BY  THE  PROGRAM  FOR  OBTAINING  RECOGNITION  DATA. 


-  A  TEXT  FILE  CONTAINING  THE  OVERALL  FUZZY  VARIABLES  THAT 
ARE  USED  FOR  CALCULATING  THE  WORD  STATISTICS. 


-  A  TEXT  FILE  CONTAINING  INITIAL  FUZZY  VARIABLE  VALUES,  MINIMUM 
LIMITS,  AND  MAXIMUM  LIMITS,  THIS  FILE  IS  USED  BY  PROCEDURE 
FUZZYOPT  TO  INITIALIZE  THE  OPTIMUM  FUZZY  VARIABLES  OF  WORDS 
ADDED  TO  THE  VOCABULARY. 


FZOPT #  -  A  RANDOM-ACCESS  FILE  CONSISTING  OF  ONE  RECORD  PER  WORD. 

THE  PURPOSE  OF  THIS  FILE  IS  TO  MAINTAIN  EACH  WORD'S  OPTIMUM 
FUZZY  VARIABLES  AND  VARIABLE  LIMITS.  THIS  FILE  IS  USED  BY 
PROCEDURE  FUZZYOPT. 


NOTE: 

»  -  INDICATES  THAT  THE  ACTUAL  FILE  NAME  IS  PRECEDED  BY  THREE  INITIALS. 
FOR  EXAMPLE,  THE  FOLLOWING  FILES  ARE  CURRENTLY  BEING  USED  TO 
COLLECT  DATA  FOR  MULTIPLE  SPEAKERS: 

ALLWFILE 

ALLSFILE 

ALLSPECH 

ALLRECOG 

ALLFZOPT 


GLOBAL  CONSTANTS 


CONSTANT 


DESCRIPTION 


SPELLENGTH 

MAXPHONES 

MAXPHPLUS1 

NOPHONEHES 

NOGUESSES 

MAXNOUORDS 

RCRDLENGTH 

MAX  VECTORS 

NOTOAVG 


NOFUZZYVARS 

BECINWORD 


-  THE  MAXIMUM  NUMBER  OF  LETTERS  FOR  SPELLING  A  WORD. 

-  THE  MAXIMUM  NUMBER  OF  PHONEMES  IN  A  WORD'S 

PHONEME  REPRESENTATION. 

-  MAXPHONES  PLUS  1. 

-  THE  NUMBER  OF  PROTOTYPE  PHONEMES. 

-  THE  NUMBER  OF  GUESSES ( CHO I CES >  PROVIDED  BY  THE 
ACOUSTIC  PROCESSOR. 

-  THE  MAXIMUM  NUMBER  OF  WORDS  IN  THE  VOCABULARY. 

-  THE  NUMBER  OF  VECTORS  PER  RECORD  USED  IN  STORING 
SPEECH  FILES. 

-  THE  MAXIMUM  NUMBER  OF  VECTORS  OF  A  SPEECH  FILE  TO 
BE  IN  MEMORY  AT  ONE  TIME  (2  *  RCRDLENGTH). 

-  THE  NUMBER  OF  VECTORS  TO  BE  AVERAGED  BY  THE  RECOG¬ 
NITION  ROUTINE  TO  DETERMINE  IF  A  PHONEME  IS  A 
TRANSITION  BETWEEN  WORD  PHONEMES. 

-  TOTAL  NUMBER  OF  FUZZY  VARIABLES  USED  IN  THE  PROGRAM. 

-  THE  STARTING  VECTOR  OF  A  WORD  IN  A  SPEECH  FILE. 
(PROVIDED  FOR  THE  FUTURE  IMPLEMENTATION  OF  A  CONTINUOUS 
SPEECH  ALGORITHM) 


GLOBAL  VARIABLES 


VARIABLES  DESCRIPTION 


y D AT  A  -  ONE  RECORD  OF  DATA  STORED  IN  FILE  WFILE. 

WSTATS  -  ONE  RECORD  OF  DATA  STORED  IN  FILE  SFILE. 

IOUT  -  ONE  RECORD  OF  DATA  STORED  IN  FILES  SF'ECH  OR  RECOG. 

FLIMIT  -  ONE  RECORD  OF  DATA  STORED  IN  FILE  F20PT. 

OPTION  -  VARIABLE  USED  TO  INDICATE  THE  FUNCTION  TO  BE  PERFORRED. 

STOPFLAG  -  FLAG  USED  TO  STOP  PROGRAM. 

CHNGEFLAG  -  FLAG  USED  BY  PROCEDURE  AUTODET  TO  DETERMINE  WHEN  TO 

STOP. 

UORDSCORE  -  WORD  SCORE  OBTAINED  BY  PROCEDURE  SCOREWORD. 

DELCNT  -  NUMBER  OF  DELETIONS  OBTAINED  BY  PROCEDURE  SCOREWORD, 

DELW  *  TOTAL  DELETION  SCORE  OBTAINED  BY  PROCEDURE  SCOREWORD. 

SCORECOUNT  -  TOTAL  NUMBER  OF  VECTORS  IN  A  SPEECH  FILE  OBTAINED  BY 

PROCEDURE  SCOREWORD. 

INSRTSCORE  -  TOTAL  INSERTION  SCORE  OBTAINED  IN  PROCEDURE  SCOREWORD. 

PWORD  -  TOTAL  WORD  SCORE  EXCLUDING  THE  EFFECTS  OF  DELETIONS 

OBTAINED  BY  PROCEDURE  SCOREWORD, 

WORD  -  WORD  CURRENTLY  BEINC  OPERATED  ON. 

BEGINREC  *  BEGINING  RECORD  OF  A  SPEECHFILE. 

IRECORD  -  CURRENT  RECORD  BEINC  OPERATED  ON. 

ENDFLAG  FLAG  USED  TO  INDICATE  WHEN  A  PROCESS  HAS  BEEN 

COMPLETED. 

SPEAKER  -  THE  THREE  INITIALS  PECEEDING  THE  DATA  FILES. 

FILENAME  -  FILE  TITLE  OF  FILE  TO  BE  USED. 

ENDWQRD  -  VARIABLE  INDICATING  THE  CURRENT  END 

MARK  OF  A  SPEECH  FILE. 

UORDLENGTH  *  NUMBER  OF  PHONEMES  IN  A  WORD'S  PHONEME  REPRESENTATION. 

UNUHB /WORDNO  -  NUMBER  CORRESPONDING  TO  THE  WORD  BEING  OPERATED 

ON. 

WORDFLAG* UORDCNT  -  NUMBER  CORRESPONDING  TO  THE  NUMBER  OF  THE  SPEECH 

FILE  THAT  WAS  LAST  OPERATED  ON  BY  AN  AUTOMATED 
PROCEDURE  (RECOGALL). 

NOPHVECTORS  -  NO  OF  VECTORS  TO  BE  USED  IN  DETERMINING  A  WORD'S 

PHONEME  REPRESENTATION.  INDICATES  THE  NUMBER  OF  VECTORS 
PER  WORD  PHONEME. 

OPERATION  *  OPERATION  TO  BE  PERFORMED, 

OPERATE  -  ARRAY  USED  BY  THE  AUTOMATED  PROCEDURES  TO  INDICATE 

THE  WORDS  TO  BE  OPERATED  ON. 


FUZZY  VARIABLES 


VARIABLE  DESCRIPTION 


STHR 


-  DETERMINES  WHETHER  OR  NOT  A  SUBSTITUTION 


ERROR  OCCURRED, 

SUBE f SUB F  -  VARIES  THE  EFFECT  QF  A  SUBSTITUTION  ERROR 
ON  THE  WORD  SCORE. 

INSE  ,INSF  -  VARIES  THE  EFFECT  OF  AH  INSERTION  ERROR 
ON  THE  WORD  SCORE, 

DELE,DELF  -  VARIES  THE  EFFECT  OF  THE  STATISTICS  ON  THE 

NUP1BER  OF  TINES  A  WORD  PHONEME  HAS  BEEN  DELETED, 

DELC  -  WEIGHTS  THE  EFFECT  OF  THE  OVERALL  DELETION 

SCORE  VERSUS  PUORD. 

DCNE,DCNF  -  VARIES  THE  EFFECT  OF  THE  TOTAL  DELETION 
SCORE  ON  THE  OVERALL  WORD  SCORE. 

DCNC  -  WEIGHTS  THE  EFFECT  OF  DELW  VERSUS  DELCNT. 

SFE,SFF  -  VARIES  THE  EFFECT  OF  THE  SCALE  FACTOR 

(SFACTOR)  ON  THE  PHONEME  WEIGHT (CHVA L U E ) . 

CHVE,CHVF  -  VARIES  THE  EFFECTS  OF  THE  WEIGHTS  OF  THE 

GUESSES  PROVIDED  FOR  EACH  VECTOR  OF  SPEECH, 

STATE, STATF  -  VARIES  THE  EFFECT  OF  THE  STATISTICS  COLLECTED 
FOR  SUBSTITUTION  AND  INSERTION  ERRORS. 

STATG  -  WEIGHTS  THE  EFFECT  OF  THE  STATISTICS  VERSUS 

THE  PHONEME  SCORE. 

THR1E,THR1F, 

THR2E ,THR2F  -  USED  FOR  DETERMINING  TRANSITIONS  BETWEEN  WORD  PHONEMES 


NOTE:  REFER  TO  PROCEDURE  SCOREWORD  TO  DETERMINE  THE  EXACT  EFFECT 
OF  EACH  OF  THESE  VARIABLES. 


»«*««ft*«**««*««ft»««*«****»**ft«*********»X*K*«**«*»***«»»»«»«*«»tt««****«**} 


PROGRAM  LEARNWORD  (INPUT, OUTPUT, SPECH,WFILE ,SFILE ,0UT2, RESULTS, 
STATSOUT , FUZZY VAR ,OPTFUZZY ,FZQPT) ; 

CONST 

SPELLENGTH  =  20 ; 

NAXPHQNESs20; 

MAXPHPLUS1  =  21 ; 

N0PH0NENES-71 ; 

NOGUESSES- 5 ; 

HAXN0WQRDS-50; 

RCRDLENGTHMO  {RCRDLENGTH  MUST  BE  A  MULTIPLE  OF  TWENTY); 

HAXVECTORS’BO; 

N0T0AVG-5; 

BEG I NWORD - 1 ; 

NOFUZZYVARS-22; 

TYPE 

WORDDATA:RECORD 

SPELLING :STR1NGISPELLENGTH) ; 

PHREP : ARRAY! 1 , .MAXPHONES]  OF  INTEGER; 


WSTHR,USUBE,WSUBF,WINSE:REAL; 

UINSF,yDEL£,WDELF,UDELG:REAL; 

WDCNE,UDCNF,WDCNG:REAL; 

ySFE,USFF,UCHVE,UCHVF:REAL; 

HSTATE,WSTATF,WSTATG:REAL; 

UTHR1E,UTHR1F,UTHR2E,UTHR2F:REAL; 

END  UWORDDATA*)  ; 

«ORDSTATS=RECORD 

NOSPOKE ,NOATTEHPT ,NOCORRECT slNTEGER  ; 

NDELsARRAYCl . .HAXPHONES]  OF  INTEGER; 
XSUBY,XINAFTZ:ARRAY!l,.HAXPHONES,i..NOPHQNEI1ES]  OF  REAL 
TOTALSCORE:REAL; 

TOTDIFFsREAL; 

HINDIFFjREAL; 

END  UWORDSTATS*); 

STATSFIL=FILE  OF  UORDSTATS; 

ISTRING=RECORD 

SPELL : ST RING! SPELL ENGTH ] ; 

CONTFLAG :INTEGER ; 

SPKR:STRINGC10] ; 

ISCORE: ARRAY !1.. RCRDLENGTH]  OF  REAL; 

IPHONE: ARRAY! 1* .RCRDLENGTH]  OF  INTEGER; 

IVERROR:ARRAY!l . iRCRDLENGTH]  OF  STRINGE5]; 
ICHPH0NE:ARRAY!1.. NOGUESSES, 1. .RCRDLENGTH]  OF  INTEGER; 
ICHVALUEsARRAYEl,. NOGUESSES, 1,. RCRDLENGTH]  OF  REAL; 
ISFACTOR:ARRAY[l.. RCRDLENGTH]  OF  REAL 
END  UISTRING*); 


WORDSTRING=FILE  OF  ISTRING; 
WORDFILE=  FILE  OF  UORDDATA; 


FUZZ YLI HI TS= RECORD 

FUZNAHE : ARRAY! 1  • • NOFUZZ Y VARS]  OF  STRINGE5]; 

FUZVAR ,NINLIRIT ,HAXLI HIT : ARRAY! 1 • . NOFUZZ Y VARS]  OF  REAL; 
RECSCORE:REAL; 

ENDUFUZZYLIHITS«); 

FUZOPT  =  PILE  OF  FUZZYLIHITS; 


VAR 


OPTIONsINTEGER; 
STOPFLAG {BOOLEAN; 
CHNGEFLAG  ^BOOLEAN; 

WORDSCOREjREAL; 
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DELCHT : INTEGER  $ 

DELW:REAL; 

SCORECOUNT iINTEGER  ; 

1NSRTSCORE ,PWORD:REAL  {WORD  SCORE}; 

UDATA:UORDDATA; 

USTATS:UORDSTATS; 

SFILEsSTATSFIL; 

SPECH:WORDSTRING; 

UFILE :U0RDF1LE ; 

IOUT  s ISTRINQ ; 

FZOPT:FUZOPT ; 

FLIHIT:FUZZYIIHITS; 

BEGINREC : INTEGER ; 

IRECORDsINTEGER; 

UORD:STRING!SPELLENGTH3 ; 

ENDFLAG:BOQLEAN; 


RESULTSsTEXT; 
STATSOUT:TEXT; 
FUZZYVARsTEXT ; 
0UT2:TEXT ; 
OPTFUZZY  sTEXT ; 


SPEAKER: STRING [33 ; 

FILENAME : STRING! 22 ] ; 

ENDUORD:INTEGER; 

UORDLENGTH:INTEGER; 

UNUMB,UORDNO: INTEGER; 

UORDFL AC ,UORDCNT:INTEGER; 

NOPHVECTORS: INTEGER; 

OPERATION:STRING!&3 ; 

OPERATE: ARRAY! 1 < .MAXNOWORDS3  OF  STRINGL33; 


STHR,SUBE,SUBF:REAL; 

1NSE,INSF:REAL; 

OELE/DELF , DEL G: REAL; 
DCNE,DCNF,DCNG:REAL; 
SFE,SFF:REAL; 

CHVE,CHVF:REAL; 

STATE,  STATF,STATG:REAL; 
THR1E,THR1F ,THR2E,THR2F:REAL; 


FUNCTION  POWER! ARC, PUR: REAL) :REAL;  FORWARD; 

PROCEDURE  1NITFILE;  FORWARD; 

PROCEDURE  GETFUZZYVARS;  FORWARD; 

PROCEDURE  GETUORDVARS;  FORWARD; 

SEGMENT  PROCEDURE  INITSTAT(N0WORD:INTEGER);  FORWARD; 
SEGMENT  PROCEDURE  W1NIT1;  FORWARD; 

SEGMENT  PROCEDURE  STOREUORD;  FORWARD; 
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<  $1  SCORE. PA  > 

{  $1  UPDATE. PA  ) 

{  ♦!  ISORECOG, PA  > 

<  «I  CHNGEREP.PA  > 

{  $1  AUTODET.PA  > 

(  $1  CHHGEFUZ.PA  > 

{  $1  AUTOMAT. PA  > 


{*#*##«*##*#«#*##*#*#«*#*##**##*##*##*»**#*####*»»#*»#**##****####«##»* 


PROCEDURE  WINIT1  DESCRIPTION 


THIS  PROCEDURE  ADDS  A  NEW  SPEECH  FILE  TO  THE  EXISTING  TRAINING 
FILES  STORED  IN  FILE  SPECH  FOR  A  CIVEN  WORD.  IF  THE  WORD  IS  NOT 
IN  THE  CURRENT  VOCABULARY,  THIS  PROCEDURE  ADDS  THE  WORD  TO  THE 
VOCABULARY,  AND  INITIALIZES  THE  NECESSARY  DATA  STRUCTURES  FOR  THE 
WORD. 

THE  USER  IS  REQUIRED  TO  ENTER  THE  SPELLING  OF  THE  WORD,  AND 
THE  NAHE  OF  THE  SPEECH  FILE  TO  BE  STORED.  ALSO,  IF 
THE  WORD  IS  NOT  IN  THE  CURRENT  VOCABULARY,  THE  USER  HUST  ENTER  AN 
INITIAL  PHONEME  REPRESENTATION  FOR  THE  WORD  (THIS  REPRESENTATION  DOES 
NOT  HAVE  TO  BE  VALID). 

SINCE  IT  IS  VITAL  THAT  THE  WORD'S  SPELLING  GIVEN  IN  THE  SPEECH 
FILE  IS  ACCURATE,  A  CHECK  IS  PERFORMED  TO  ASSURE  THAT  THIS  SPELLING  AGREES 
WITH  THE  SPELLING  INITIALLY  ENTERED  BY  THE  USER. 

»**«*»«»*****«*««««»«***«»*»***»**«**«**»«*««**«*******#*****»»**»*****»»*} 


SEGMENT  PROCEDURE  WINIT1? 

VAR 

W  ^  S : INTEGER ; 

I , J ,1NT  sINTEGER; 

TEHPWORD : STRING CSPEL LENGTH  1 ; 
ANSWER :STRING[3I ; 

ANS : I NTEGER ; 

FLAGjBOOLEAN; 

NAHE :STRINGC201 ; 


BEGIN («INIT1«) 


ENDFLACs-FALSE; 

WRITELN; 


•Ciet  »ord  to  be  learned  and  detenine  if  it 
is  in  the  current  vocabulary,  ) 

WRITELN  ('INPUT  EXACT  SPELLING  OF  WORD  TO  BE  LEARNED.  '); 

WRITE  ('WORD  ‘  '); 

READLN  (WORD); 

FOR  I := (LENGTH* WORD) )  TO  (SPELLENGTH-1 )  DO 
BEG1N(*F0R«) 

WORD ::  CON  CAT  (WORD/  '); 

ENDUFOR*) ; 

WRITELN  ; 

RESET(SFILE); 

RESET  (UFILE) ; 

TENPWORD:5 ‘  '; 
y ORDNO :  =  0 ; 

WHILE  (NOT  EOF ( WFI LE )) AND (WORD < )TEHPWORD)  DO 
BEGIN  (*WHILE«) 

SEEK (UFILE ;UORDNO) ; 

GET(WFILE) ; 

WDATA:sWFILE*; 

GET(WFILE); 

TENPWORD : 5 UD AT A, SPELLING; 

WORDNO:5WORDNO+  1 
END  UWHILE*); 

Uf  lord  does  not  exist,  and  user  »ants  to  add  it, 
initialize  the  word's  data  structures.) 

IF  (WORDOTEHPUORD)  THEN 
BEGIN  ( *1F* ) 

WRITELN; 

WRITELN  ('WORD  DOES ;N0T  EXIST.  IF  YOU  WOULD  LIKE  TO  ADD  THIS  '); 
WRITELN  ('WORD  TO  THE  VOCABULARY,  ENTER  "YES”,  ELSE  ENTER  "NO".') 
WRITE  ('ANSWER  *  '); 

READLN  (ANSWER); 

IF  ANSUERO'YES'  THEN 
BEGIN  (*IF») 

ENDFLACs-TRUE; 

END  ( *  IF* ) 

ELSE 

BEGIN  UELSE*) 


{Initialize  the  «ord  fuzzy  variables  by  setting  then 
equal  to  the  overall  variable  values,) 

WITH  WDATA  DO 
BEGIN  («WITH») 

SPELLING: ‘WORD; 

GETFU2ZYVARS; 

WSTHR:*STHR; 

WSUBE:*SUBE; 
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USUBF :  =  SUB  F ; 

UINSE:=INSE; 
y INSF :  =  I NSF ; 

WDELE:=DELE; 

UDELF:=DELF; 

WDELCi-DELt; 

UDCNE:=DCHE; 

UDCNF:=DCNF; 

UDCNGi=DCNG; 

USFE::SFE; 

WSFFi*SFF; 

UCHVE:=CHVE; 

UCHVFisCHVF; 

USTATE:=STAT£; 

WSTATFs=STATF; 

USTATG:=STATG ; 
yTHR 1 E :  =  THR 1 E ; 

UTHR1F:STHR1F; 

UTHR2E:*THR2E; 

UTHR2F:STHR2F; 

URITELN; 

URITELN ; 

(Get  the  word's  phoneee  representation.  Note  that 
the  representation  does  not  have  to  be  valid. > 

URITELN  (‘ENTER  THE  NUMERIC  VALUE  (01  TO  71)  FOR  '); 
URITELN  ('EACH  PHONEME  IN  THE  UORD,  UP  TO  A  MAXIMUM  OF  '); 
URITE  (MAXPHONES); 

URITELN  ('  PHONEMES.  AFTER  ALL  UORD  PHONEMES  HAVE  BEEN  ') 
URITELN  CENTERED,  ENTER  "00”.  >; 

U :  =  1 ; 

ANS ;  =  7 1  * 

UHILE  (W(sHAXPHONES)AND(ANS< >00)  DO 
BEGIN  («UHILE«) 

URITELN; 

URITE  CUORD  PHONEME  ',U:2,'  *  '); 

READLN  (ANS); 

IF  (00( : ANS)AND( ANS(s71)  THEN 
BEGIN  (»IF») 

PHREPtWI :*ANS; 

U:--W+l 
END  <«IF«) 

ELSE 

BEGIN  (»ELSE»> 

URITELN; 

URITELN  ('PHONEME  NOT  VALID,  REENTER'); 

END  («ELSE») 

END  (ftUHILE*) ; 


IF  PHREPUI  «  00  THEN 


ENDFLAG !  =  TRUE 
ELSE 

BEGIN  (#ELSE») 

IF  N<MAXPHONES  THEN 
BEGIN  ( *  I F* ) 

FOR  J:=  U  TO  HAXPHONES  DO 
PHREPL  J  3 : - 00 
END  UFOR#); 

END  < « I F  * ) ; 

IF  U0RDN0:0  THEN 
BEGINUIF*) 

APPEND(WFILE) $ 

END  <*IF«> 

ELSE 

BEGIN ( «ELSE* ) 

SEEK  CWFILE , (UORDNO-1 ) ) ; 
GET(WFILE) ; 

END  <  »ELSE* ) ; 

WFILE* :  =  U  D  A  T A ; 

PUT  (UF1LE) ; 


Unitialize  the  nord  statistics.) 

WITH  WSTATS  DO 
BEGIN  (»WITH») 

N0SP0KE:=0; 

N0C0RRECT:=0; 

N0ATTEHPT:*0; 

TOTDIFFj'O.O; 

HINDIFF:=1.0? 

FOR  1 5s  1  TO  HAXPHONES  DO 
BEGIN  (*FOR») 

NDELCns*0? 

FOR  J:=  1  TO  NOPHONEHES  DO 
BEGIN  ( * F 0 R « ) 

XINAFTZCI ;J J ::0.0; 
XSUBYCI , J3:  =  0.0 
END  <«FOR») 

END  <»F0R*>; 

TOTALSCORE:sO.O; 

IF  W0RDN0=0  THEN 
BEGIN ( * I F* ) 

APPEND(SFILE) ; 

END  (*IF») 

ELSE 

BEGINUELSE*) 

SEEK ( SF I LE f (UORDNO-1 ) ) \ 
GET(SFILE) ; 

END (*ELSE«) ; 

SFILE* :*WSTAT$; 

PUT  (SFILE) 

END  (»UITH*) 


END  (»ELSE») 


END  (*UITH#> 

END  <*ELSE»> 

END  <»IF»); 

IF  ENDFLAG  =  FALSE  THEN 
BEGIN  (*IF*> 

(Get  speech  file  to  be  used  for  training,  and  assure 
that  the  speaker  and  word  spelling  are  valid*} 


RESET ( SPECH ) ; 

URITELN; 

URITELN( ' INPUT  NANE  OF  SPEECHFILE' ) ; 

URITE( ' FILENAME  *  '); 

RE ADLN ( NAME ) ; 

URITELN; 

FLAG::TRUE; 

INT:-0; 

J:-l; 

UHILE  JOLENGTH(NAHE)  DO 
BEGIN  («UHILE«) 

IF  NAHEC JI : ' : '  THEN 
BEGIN  (*IF») 

INT:S J; 

END(*IF») ; 

END ( *WHI LE* ) ; 

FOR  I s=l  TO  3  DO 
BEGINUFOR*) 

IF  SPEAKERS  1 3  < >  NANECItlNT]  THEN 
FLAG::FALSE; 

END  («F0R«); 

IF  (( FLAG* FALSE) AND (SPEAKER<  >' ALL' ) )  THEN 
BEGIN ( *IF» ) 

URITELN; 

URITELN; 

URITELNt ' SPEAKER'S  INITIALS  DO  NOT  AGREE  UITH  THE  '); 
URITELNC FIRST  3  LETTERS  OF  THE  SPEECHFILE.  DO  YOU  ') 
URITELN < ' UISH  TO  CONTINUE  ANYUAY!  ENTER  "YES"  TO’); 
URITELNC CONTINUE,  ELSE  ENTER  "NO".'); 

URITE( ' ANSWER  =  '); 

READLNt ANSUER) ; 

URITELN; 

IF  ANSUERO  'YES'  THEN 
BEGINUIF*) 

ENDFLAG:=TRUE; 

END(«IF»); 

END  (»IF»); 

IF  ENDFLAG=FALSE  THEN 
BEGIN ( *IF* ) 

FILETITLE(QUT2,NAHE) ; 

RESET  (0UT2); 

UITH  IOUT  DO 
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BEGIN  OWITH*) 

READLN(QUT2, SPELL); 

FOR  Is=  (LENGTH(SPELL) )  TO  (SPELLENGTH-1)  DO 
BEGIN  OFOR*) 

SPELL:  =  CONC AT (SPELL , '  *); 

END  OFOR*) ; 

WORD:=WDATA. SPELLING; 

IF  SPELLOUORD  THEN 
BEGIN  OIF*) 

FLAGs-FALSE; 

REPEAT 
WRITELH ; 

WRITELN  (‘THE  SPELLING  OF  THE  WORD  GIVEN  ABOVE  DOES'); 
WRITELN  ('NOT  AGREE  WITH  THE  SPELLING  GIVEN  IN  THE'); 

WRITE  ('FILE  “',NAHE,"'.  THE  SPELLING  IN  ',NAHE,'  IS'); 
WRITELN  ('  ”', SPELL,'”  '); 

WRITELN  ('DO  YOU  WANT  TO  CHANGE  THE  '",NAHE,"'  SPELLING?') 
WRITELN  ('ENTER  "YES”  TO  CHANGE,  ELSE  ENTER  "NO"'); 

WRITE  ('ANSWER  =  '); 

READLN( ANSWER); 

IF  ANSWERO'YES'  THEN 
BEGIN  OIF*) 

ENDFLAGs-TRUE; 

END  OIF*) 

ELSE 

BEGIN.  OELSE*) 

WRITELN; 

WRITELN  ('ENTER  TK1  CORRECT  SPELLING  OF  WORD.'); 

WRITE  ('WORD  *  '); 

READLN  (SPELL); 

FOR  I:*  (LENGTH(SPELL) )  TO  (SPELLENGTH-1)  DO 
BEGIN  OFOR*) 

SP ELL :*C0NC AT (SPELL,'  '); 

END  OFOR*) ; 

END  OELSE*) ; 

IF  (ENDFLAG-TRUE)  OR  ( SPELL :WD AT A. SPELL  INC)  THEN 
BEGIN  OIF*) 

FLAG:sTRUE; 

END  (*IF») ; 

UNTIL  FLAG; 

END  (*IF*) ; 

ENDOWITH*) ; 

END(*IF») ; 

END  (*IF»); 


END  OPROCEDURE  1NIT1*); 


(xx#xxxxxxxxxxxxxxxxxxxxxxxx*xxxxxx«x*»*x*xxx***xxxxxxxxxxxxxxxxxxxxxx 


PROCEDURE  STOREWORD  DESCRIPTION 


THIS  PROCEDURE  DIVIDES  A  SPEECH  FILE,  REFERRED  TO  AS  QUT2, 
INTO  RECORDS  OF  THE  FORMAT  REQUIRED  BY  FILE  SPECH,  AND  STORES 
THESE  RECORDS  IN  THE  FILE  SPECIFIED  AS  SPECH.  NOTE  THAT  THIS 
PROCEDURE  ASSUMES  THAT  THE  WORD  SPELLING,  STORED  IN  THE  SPEECH 
FILE,  HAS  ALREADY  BEEN  READ. 


XXXXXKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKKXXXXXXXXXXXXXXXXXXXXXXXXXX} 


SEGMENT  PROCEDURE  STOREWORD; 
VAR 


TEMPIOUT:ISTRING; 

INT ,I,J:INTEGER; 

BEGIN  (xpROCEDURE  STOKEWORDx) 

WITH  IOUT  DO 
BEGINUUITHx) 

READLN(0UT2,SPKR) ; 

FOR  1 5  =  1  TO  8  DO 
BEGIN  (xFORx) 

READLN ( OUT2 )  ; 

END  (xFORx); 

IREC0RDi=0; 


(Insert  first  record  of  speech  file  at  end 
of  file  SPECH.} 

RESET(SPECH); 

WHILE  NOT  EOF(SF'ECH)  DO 
BEGIN(xWHILEx) 

SEEK (SPECH, I  RECORD) ; 

GET  (SPECH); 

TEMPIOUTi^SPECH*; 

IRECORD:-IRECORD+l; 

GET(SPECH); 

END  («WHILE«>; 

BEGINREC  t  =  IRECORD ; 

(Divide  speech  file  into  records,  and 
initialize  the  data  structures.} 

Js*0; 

WHILE  NOT  E0F(0UT2)  DO 
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BEGIN  (»UHILE«) 

J:=J+l; 

IF  J  =  <RCRDLENGTH+1 )  THEN 
BEGIN  <  * I F* ) 

CONTFLAG  s  =  1 ; 

IF  IRECORD-O  THEN 
BEGIN  <  *  IF* ) 

APPEND(SPECH) ; 

END  (*IF») 

ELSE 

BEGIN(*ELSE« ) 

SEEK ( SPECH, ( IRECGRD-1) ) ; 
GET(SPECH) ; 

END ( *ELSE*  > ; 

I RECORD :sIRECORD  +  l ; 

SPECH* :  =  IOUT ; 

PUT(SPECH); 

J  :sl ; 

END  (*IF# ) ; 

READ(0UT2,INT); 

FOR  I :s 1  TO  NOGUESSES  DO 
BEGINUFOR*) 

READ  (OUT2,ICHPHONECI  ,J3> ; 
READ  (0UT2,ICHVALUEL1,J]); 
END  («FOR»); 

READLN  (0UT2,ISFACT0R[JI) ; 
ISC0REN]:  =  0.0; 
lPHONEtJl*.=0*, 

IVERRORC JI ' XXXXX' ; 

END  UHHILE*); 

CONTFLAG  s  =  0 ; 

FOR  INTss(J+l)  TO  RCRDLENGTH  DO 
BEGIN  (»FOR# ) 

FOR  1 1  TO  NOGUESSES  DO 
BEGIN  (*FOR#> 

ICHPHONEtl ,  INTI s  =  0; 
ICHVALUECI,INT]s=1.0; 

END  (*FOR»); 

ISFACTORL INTI  5  =  0.0; 
I3C0RECINT3  s  =  0.0; 

IPHONEEINTD i=0; 

IVERRORC INT3 ;= ' XXXXX' ; 

END  («FQR»); 

IF  IREC0RD=0  THEN 
BEG IN(*IF*> 

APPEND (SPECH); 

END  (»IF») 

ELSE 

BECIN(*ELSE* ) 

SEEK (SPECH , ( I RECORD- 1 ) ) ; 
GET (SPECH); 

ENDUELSE*); 

IREC0RD:  =  IREC0RD  +  1 ; 
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SFECH A  s  =  I OUT  ? 
Pl)T(SPECH) ? 

END(*W1TH») ? 

CLOSE ( 0UT2  > ; 

ENDUPROCEDURE  STOREWORD*)? 


{**####»*#*#«**#*#****##*******»«**##**#*#*♦*#*#«##»*##***#********#«» 


PROCEDURE  DETPHREP  DESCRIPTION 


THIS  PROCEDURE  DETERMINES  THE  PHONEME  REPRESENTATION  FOR  A 
GIVEN  WORD  BASED  ON  THE  THE  WORD'S  TRAINING  SET  STORED  IN  FILE 
SPECH,  BASICALLY,  THIS  PROCEDURE  MAKES  AN  INITIAL  EDUCATED  GUESS 
OF  THE  PHONEME  REPRESENTATION,  THEN  ITERATIVELY  ALTERS  THIS  REPRESENT¬ 
ATION  BASED  ON  THE  STATISTICS  OBTAINED  FROM  ATTEMPTING  TO  RECOGNIZE 
THE  WORD  USING  THE  TRAINING  SET  SPEECH  FILES. 

THE  USER  MUST  ENTER  THE  WORD  TO  BE  OPERATED  ON,  THE  NUMBER  OF 
VECTORS  PER  WORD  PHONEME,  AND  THE  MAXIMUM  NUMBER  OF  ITERATIONS 
(NOITERATIONS)  TO  BE  PERFORMED.  REFER  TO  THE  DESCRIPTION  GIVEN  FOR 
PROCEDURE  AUTODET  FOR  A  MORE  DETAILED  DISCUSSION  OF  THE  ALGORITHM 
EMPLOYED. 


XXXXKXXKXXXXXXKXXXXXXXXXXXXXXXXKXXXXXXXXXKKXXXXXXXXXXXKXXKXXXXXXXXXXXKX} 


SEGMENT  PROCEDURE  DETPHREP? 

VAR 

NOITERATIONS,ICOUNT, I, NOWORD; INTEGER? 
TEMPWORD ,DETWORD : STRING! SPELLENGTHJ? 

BEGIN  ODETPHREPx) 

1N1TFILE? 

GETFUZZYVARS? 

RESET(WFILE) ? 

RESET(SFILE); 

WORDNO:sO? 
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N  DWORD : =0 ; 

SEEK (UF I LE , NOWORD) ; 

GET(UFILE) ; 

WRIT ELK; 

URITELN ( ' THE  FOLLOWING  IS  THE  CURRENT  VOCABULARY'); 

WRITELN; 

WHILE  NOT  EOF(WFILE)  DO 
BEGIN  ( #WHILE* ) 

WDATA:=WFILEA ; 

WRITELN(WDATA, SPELLING); 

GET(WFILE); 

END  (*UHILE«); 

WRITELN; 

WRITELN; 

WRITELNt ' INPUT  WORD  TO  DETERMINE  PHONEME  REPRESENTATION'); 
WRITELN; 

WRITE  ('WORD  *  '); 

READLN  (DETUORD); 

WRITELN  ; 

FOR  I;:<LENGTH(DETWORD>)  TO  (SPELLENGTH-1 )  DO 
BEGIN( *FOR*) 

DET WORD :• CONC AT (DET WORD , '  '); 

END(*FOR*) ; 

RESET  (WFILE) ; 

TEKPWORD:2 '  '; 

NOWORDi-O; 

WHILE  (NOT  EOF ( WFILE )) AND (DETWORD( >TEMPWORD)  DO 
BEGIN  (*WHILE») 

SEEK(WFILE, NOWORD); 

GET(WFILE); 

WDATA:-WFILE* ; 

GET(WFILE); 

TEHPWOKD:=WDATA. SPELLING; 

NOWORD:sNOWORD+  1 
END  (*UHILE«); 

IF  (DETWORDOTEMPWORD)  THEN 
BEGIN  <  *  I F* ) 

WRITELN; 

WR ITELN ( ' UORD  DOES  NOT  EXIST.'); 

ENDFLAGi=TRUE; 

END  (*IF*) 

ELSE 

BEGINUELSE*) 

ENDFLAGj  =  FALSE  ; 

NOWOKD:=NOWORD-1 ; 

WRITELN; 

WRITELN; 

WRITELN( *  ENTER  THE  NUMBER  OF  VECTORS  PER  WORD  PHONEME.'); 
WRITEC*  VECTORS  *  '); 

READ(NOPHVECTORS) ; 

WRITELN; 

WR I TELH ( ' ENTER  THE  NUMBER  OF  ITERATIONS  TO  BE  PERFORMED.') 
WRITEC NO.  ITERATIONS  *  '); 

READLN (NOITERAT I ONS > ; 


WR1TELN; 

FILETITLE (STATSOUT  ^'PHREPDAT'); 
REWRITE(STATSOUT) ; 

OPERATION s  = ' AUTODET '  ; 

AUTODETt NOWORD, NOPHVECTORS ,CHNG£FLAG) ? 

INITSTAT (NOWORD) ; 

IC0UNTs=0; 

REPEAT 

ICOUNT:  =  IC0UNT+1 ; 

CHNGEFLAG  s  =  FALSE ; 

OPERATIONS1 ' CORRECTS ' ? 

AUTODET (NO WORD ,NOPH VECTORS , CHNGEFLAG); 
INITSTAT (NOWORD) ; 

OPERATION s  = ' ADJUSTDE'  ; 

AUTODET (NOUORfi ,NOPHVECTGRS  f CHNGEFLAG ) ; 
INITSTAK  NOUORD ) ; 

UNTIL  ( NOI TER AT I ONS= I COUNT) OR (CHNGEFLAG1 FALSE) ; 
CLOSE(STATSOUT); 

END ( »ELSE* ) $ 


ENDOPROCEDURE  DETPHREP*  > ; 


{»#*##»####*#*##»**#*####**#*#*#*##*****#*#*#*###«#»###**#*»**»#*****#*# 


PROCEDURE  INITSTAT  DESCRIPTION 


THE  PURPOSE  OF  THIS  PROCEDURE  IS  TO  CALCULATE  A  WORD'S  STATISTICS 
USING  THE  WORD'S  TRAINING  SET  PROVIDED  IN  FILE  SPECH.  THIS  IS  DONE  BY 
OBTAINING  THE  ERRORS  THAT  OCCUR  IH  ATTEMPTING  TO  RECOGNIZE  EACH 
OF  THE  TRAINING  SET  FILES  (BY  EXECUTING  PROCEDURE  SCOREWORD),  AND 
USING  THIS  INFORMATION  TO  UPDATE  THE  STATISTICS  (BY  EXECUTING  PROCEDURE 
UPDATE). 


»««*«««* «*««*« *««**««*««•««»«**««*««*««««««««*«*} 


SEGMENT  PROCEDURE  1NITSTAT; 


J ,IHT  tIMTEGER? 

BEG1N(#INITSTAT») 

WRITELN; 

UR  I TELN ( ' I N I TST AT ‘  ) ; 

Unitialize  word  statistics.) 

WITH  WSTATS  DO 
BEGIN  UWITH#) 

NOSPOKE:=0; 

NOCORRECT:1!); 

NOATTEMPT:sO; 

T0TDIFF:*0.0; 

HINDIFFi=1.0; 

FOR  I  NT  2  =  1  TO  HAXPHONES  DO 
BEGIN  UFOR*) 

NDELIINT):=0; 

FOR  J:=  1  TO  NOPHONEMES  DO 
BEGIN  (*FOR«) 

XINAFTZIINT ;J3 5*0.0; 

XSUBYCINT ,J]  :s0.0 
END  UFOR*) 

END  UFOR*); 

T0TALSC0RE::0.0; 

SEEK ( SFILE , NOWORD); 

SF1LE*  s=WSTATS; 

PUT  (SFILE) 

END  UWITH*) $ 

SEEKIMFILE, NOWORD); 

GET(WFILE) ; 

WDATAj-UFILE*; 

URITELN ( ' WORD  =  ' ,WDATA . SPELLING) ; 
WORD:=WDATA. SPELLING; 

BEGINREC:-0; 

{Update  statistics  using  the  word's  training 
files  stored  in  file  SPECH.) 

RESET(SPECH) ; 

WHILE  NOT  EOF(SPECH)  DO 
BEGIN(*WHILE») 

SEEK (SPECH, BEG  I NREC) ; 

GET(SPECH) ; 

IOUTi=SPECH* ; 

IF  IOUT.SPELL*WORD  THEN 
BEGINUIF*) 

SCOREWORD; 

UPDATE; 
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SEEK ( SP ECH ,BEG1NREC) ; 
REPEAT 

GET ( SPECH ) ; 
IOUTssSPECHA ; 
BEGINREC:=BEGINREC+1; 
UNTIL  t IOUT »C0NTFLAG  =  0) ; 
GET(SPECH); 

END ( * IF» ) 

ELSE 

BEGIN ( *ELSE* ) 

BEGINREC:=BEGINREC+1; 

GET (SPECH); 

END(*ELSE»); 

ENDUUHILE*) ; 

END< ^PROCEDURE  INITSTAT#) ; 


{»**»**#*#*#**###***##***»**####********#####**###**#***########*#*##* 


FUNCTION  POWER  DESCRIPTION 


THIS  FUNCTION  IS  PASSED  TWO  VARIABLES, .ARG  AND  PUR,  WHICH 
IT  USES  TO  CALCULATE  THE  VALUE  OF  POWER,  WHICH  IS  EQUAL  TO 
“ARC”  RAISED  TO  THE  "PUR"  POWER. 


*#***##*#*##*#*###»##**#**###*##**#**#####**#*#*#**###***##**#**#*####> 


FUNCTION  POWER; 

BEGINOPOUER*) 

IF  ( (ARG>s0) AND (ARG< 0.0001 ) )  THEN 
P0UER::0 
ELSE 

POWER : : EXP (PWRk(LN( ARC ) ) ) ; 


END  UPOUER*) 


I*********************************************************************** 


PROCEDURE  INI TF HE  DESCRIPTION 


THIS  PROCEDURE  ACCEPTS  THE  THREE  INITIALS  PRECEEDING  THE 
FILES  TO  BE  OPERATED  UPON,  AND  CONCATINATES  THEN 
WITH  THE  APPROPRIATE  FILE  NAMES.  THE  FOLLOWING  FILES  ARE  AFFECTED: 


UFILE 

SFILE 

SPECH 

FZOPT 


*************************************************************#***********} 


PROCEDURE  INITFILE; 


BEGIN  (*INITFILE») 

URITELN; 

URITELN; 

WRITELN( 'ENTER  "THREE"  INITIALS  FOR  THE  CURRENT  ^PERMANENT#  '); 
WRITELN( 'SPEAKER;  OR  ENTER  "ALL"  FOR  *NON*PERMANENT*  SPEAKERS.'); 
WRITE ( ' SPEAKER  =  ‘  >; 

READLN(SPEAKER) ; 

URITELN; 

FI LENAHE:sCONC AT (SPEAKER ,' UFILE ') ; 

FILETITLEtUFILE , FILENAME); 

FILENAME :*CONCAT( SPEAKER,  'SFILE'); 

FILETITLE(SFILE, FILENAME) ; 

FILENAME : 5 CONCAT (SPEAKER  , ' SPECH' ) ; 

FILETITLE ( SPECH  # FILENAME); 

F I LENAME:-CONC AT (SPEAKER  >' FZOPT ') ; 

FILETITLE(FZOPT, FILENAME); 

ENDUPROCEDURE  INITFILE*); 


{******************************************************************** 


PROCEDURE  GETFUZZYVARS  DESCRIPTION 


THIS  PROCEDURE  READS  THE  VALUES  OF  THE  OVERALL  FUZZY  VARIABLES 
STORED  1H  THE  FILE  FUZZYVAR. 


WITH  WDATA  DO 
BEGIHUUITH*) 

STHRs=WSTHR; 

SUBE  :-USUBE; 

SUBF:=WSUBF ; 

I MSE  s  =  UIHSE; 

INSF:sWINSF ; 

DELE:=UDELE; 

DELFi=WDELF; 

DELG:=WDELG; 

DCNEisWDCNE; 

DCNF:-UDCHF ; 

DCNG:=WDCNG; 

SFE:*WSFE; 

SFF:=WSFF; 

CHVE:=WCHVE; 

CHVF:sWCHVF ; 

STATE  5  =  WST ATE ; 
STATF:=WSTATF ; 

STATG  s  =  MSTATG ; 
THR1E:=WTHR1E; 
THR1F:SWTHR1F: 
THR2E:-WTHR2E; 
THR2F:*WTHR2F; 

END(«WITH») ; 

EHD< ^PROCEDURE  GETWORDVARS*) ; 


BEGIN  UPROGRAH  LEARNWORDS*) 

STOPFLAG:sFALSE; 

REPEAT 

WRITELN; 

WRITELN; 

WRITELN< ' CHOOSE  OPTION  FROM  FOLLOWING  LIST'); 

WRITELN; 

WRITELNCl  -  COLLECT  STATISTICS  ON  A  SPEECH  FILE.'); 

WRITELN! '2  -  ISOLATED  WORD  RECOGNITION.'); 

WRITELN! ' 3  -  DETERMINE  PHONEME  REPRESENTATION  FOR  A  GIVEN  WORD.') 
WRITELN! '4  -  CHANGE  PHONEME  REPRESENTATION  OF  A  GIVEN  WORD.'); 

WR ITELN ( ' 5  -  CHANCE  THE  FUZZY  VARIABLES. ') ; 

WRITELNC 6  -  RUN  AH  AUTOMATIC  PROGRAM,’); 

WRITELN ( ' 9  -  STOP.'); 

WRITELN; 

WRITE! *  OPTION  *  ’); 

READLN(OPTION) ; 

WRITELN; 


IF  OPTION  *  1  THEN 
BEGIN ( « I F* )  " 

INITFILE; 

MINIT1; 

IF  ENDFLAGsFALSE  THEN 
BEGIN  UIF*) 

STORE WORD; 

GETFUZZYVARS; 

SCOREUORD; 

UPDATE; 

END ( *1 F* ) ; 

END(«IF«> 

ELSE  IF  OPTION  =  2  THEN 
BEGINUELSE*) 

ISORECOG; 

ENDUELSE*) 

ELSE  IF  OPTION  *  3  THEN 
BEGIN(ttELSEft) 

DETPHREP; 

ENDUELSE*) 

ELSE  IF  OPTION  *  4  THEN 
BEGLN(*ELSE«) 

CHNGEREP; 

ENDUELSE*) 

ELSE  IF  OPTION  *  5  THEN 
BEGIN(*ELSE«) 

CHNCEFUZ; 

ENDUELSE*) 

ELSE  IF  OPTION  *  6  THEN 
BEGINUELSE*) 

AUTOHAT; 

END ( »ELSE* ) 

ELSE  IF  OPTION  *  9  THEN 
BEGINUELSE*) 

STOPFLAGs=TRUE; 

END  UELSE*) 

ELSE 

BEGINUELSE*) 

URITELN; 

URI TELN ( 'OPTION  DOES  NOT  EXIST'); 
URITELN; 

ENDUELSE* ) ; 

CLOSE ( UF I LE ) ; 

CLOSE ( SF I LE ) ; 

CLOSE (SPECH); 

UNTIL  STOPFLAG; 


END  UPROGRAH  LEARN*), 
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{»«#*«« »#*****»#***»*«###****#«*#***#*#**»»#**#»#*****#*#**#*»## 


PROCEDURE  SCOREWORD  DESCRIPTION 


THIS  PROCEDURE  USES  THE  WORD  INFORMATION  STORED  IN  FILES 
UFILE  AND  SFILE,  TO  PRODUCE  A  SCORE  INDICATING  HOW  WELL 
THE  SPEECH  FILE  MATCHES  THE  GIVEN  WORD.  THE  SPEECH 
FILE  BEING  ANALYZED  IS  STORED  IN  EITHER  FILE  RECOG 
OR  A  TEMPORARY  FILE ( TEHPF ILE ) .  ALTHOUGH  THE  SPEECH  FILE  IS 
REFERRED  TO  AS  "SPECH",  THE  PROPER  FILE  NAME  IS  ASSIGNED, 

USING  THE  FILETITLE  COMMAND,  PRIOR  TO  CALLING 
THIS  PROCEDURE. 

THE  FOLLOWING  VARIABLES  MUST  BE  INITIALIZED  PRIOR  TO 
CALLING  THIS  PROCEDURE:  WORD,  BEGINREC,  BEGINUORD,  AND  NOTOAVG. 


PROCEDURE  VARIABLES 


VARIABLE  DESCRIPTION 

h 

W  -  NUMBER  INDICATING  THE  CURRENT  WORD  PHONEME. 

S  -  CURRENT  VECTOR  OF  SPEECH  BEING  EXAMINED. 

VECTOR  -  VECTOR  OF  SPEECH  BEING  EXAMINED  (NOT 

NECESSARILY  THE  CURRENT  VECTOR. 

SCORE, PHONE, 

VERROR,CHPHONE, 

CHVALUE ,SFACTOR  -  IDENTICAL  TO  THE  VARIABLES  DESCRIBED  IN  FILE  SPECH' S 
DESCRIPTION  (THE  VARIABLE  NAMES  ARE  PRECEDED  BY  AN  "I" 

IN  FILE  SPECH)  EXCEPT  THESE  ARRAYS  CONTAIN  TWICE  THE 
NUMBER  OF  SPEECH  VECTORS.  THESE  ARRAYS  ARE  NECESSARY 
TO  REDUCE  THE  COMPUTATION  TIME  REQUIRED  IN  EXAMINING 
VECTORS  OCURRING  NEAR  THE  END  OF  A  RECORD. 

OFFSET  -  USED  TO  KEEP  TRACK  OF  THE  VECTOR  NUMBER  IN  THE  ABOVE 

ARRAY'S.  THIS  IS  NECESSARY  SINCE  ONLY 
A  FINITE  NUMBER  OF  RECORDS  (OF  THE  TYPE  IOUT)  CAN  FIT  IN 
MEMORY  AT  ONE  TIME. 

PR, ERROR  -  VARIABLES  RETURNED  FROM  PROCEDURE  FINDMAX  INDICATING 
A  WORD  PHONEME'S  SCORE,  AND  MAXIMUM  LIKELY  ERROR  TO 
HAVE  OCCURRED,  RESPECTIVELY. 

PHSCR, TERROR  -  ARRAYS  USED  TO  DETERMINE  WHICH  WORD 

PHONEME  OCCURRED  FOR  A  GIVEN  SPEECH  VECTOR. 


*#*«*#***#####*###*#****##*#**«*»##} 


SEGMENT  PROCEDURE  SCOREWORD; 

{NEEDS:  WORD,BEGINREC,BEGINWORD,NOTOAVG,VAR-PWORD,SCGRECOUNT) 


TYPE 

STR5sSTRINGC5! ; 


U :  1 NTEGER  CHORD  PHONEME  POINTER); 

S : INTEGER  {STRING  PHONEME  POINTER); 

VECTORjINTECER; 

SCORE : ARRAY! 1 • .MAX VECTORS]  OF  REAL; 

PHONE: ARRAY  Cl « • MAX VECTORS!  OF  INTEGER; 

VERROR : ARRAY! 1  • .MAX VECTORS]  OF  STRING! 5 ] ; 

CHPHONE : ARRAY! 1 « «NOGUESSES,l.. MAX VECTORS!  OF  INTEGER; 
CHVALUE:ARRAYC1,. NOGUESSES,  1 . .MAXVECTORS!  OF  REAL; 
SFACTOR : ARRAY! 1 « .MAXVECTORS!  OF  REAL; 

I  ,J:INTEGER; 

FLAG : BOOLEAN ; 

N:INTEGER; 

PR ; REAL ; 

ERROR :STRING!5! ; 

PHSCR: ARRAY !1 • ♦ N AXPHONES ]  OF  REAL; 

TERROR:ARRAYCl. .MAXPHONES]  OF  STRINGC5!; 
PW,PW1,PW2:REAL; 

OFFSET:INTEGER; 

TEMPUORD:STRING!SPELLENGTH! ; 

NOAVG:INTEGER; 


PROCEDURE  UGE2SGE3;  FORWARD; 

PROCEDURE  UEQ1SLE2 ;  FORWARD; 

PROCEDURE  WEQ1SGE3;  FORWARD; 

PROCEDURE  WGE2SLE2;  FORWARD; 

PROCEDURE  FINDHAX<N:INTEGER;  VECTOR ; I NTEGER ; 

VAR  PR:REAL;  VAR  ERR0R:STR5) ;  FORWARD; 
PROCEDURE  PWISMAX;  FORWARD; 

PROCEDURE  PW1ISMAX;  FORWARD; 

PROCEDURE  PW2ISMAX;  FORWARD; 

PROCEDURE  WDELETED(N:INTEGER);  FORWARD; 

PROCEDURE  CETRECORD;  FORWARD; 


{«*#*#*»**##*#«##**##********#*#«*##**#***»##****#*#*##«#*«#**#**« 


PROCEDURE  GET2RECORDS  DESCRIPTION 


THIS  PROCEDURE  GETS  THE  FIRST  TWO  RECORDS  OF  THE  SPEECH  FILE 
(IF  THE  FILE  IS  HORE  THAN  ONE  RECORD  LONG) ,  AND  STORES  THIS 
INFORFIATION  IN  THE  APPROPRIATE  ARRAYS. 


«*###»**#*######*###*#***###**#**#***«#*#*#*#*##***»*#*»#**##*#**##*##** 


PROCEDURE  GET2REC0RDS; 
VAR 

I ;J:INTEGER; 

BEGIN UGET2REC0RDS*) 


(Initialize  record  arrays  to  zero.) 

FOR  J:*l  TO  HAXVECTORS  DO 
BEGIN  («FOR«) 

FOR  Is*  1  TO  NOGUESSES  DO 
BEGIN  («FOR») 

CHPHONEtI,J]s*0; 

CHVALUECI ,J]:=0.0; 

END  UFOR*); 

SFACTORC JD :=0 . 0 ; 

VERRORt J] ;* ' XXXXX* ; 

END  <*FOR»); 

RESET(SPECH)5 

SEEK  <SPECH,IRECORD>? 

GET  (SPECH); 

10UT:*SPECH‘; 

(Set  array  values  to  corresponding  values  given 
in  the  speech  file  being  exasined.) 

WITH  IOUT  DO 
BEGIN  <«UITH») 

FOR  J  %  *  1  TO  RCRDLENGTH  DO 
BEGIN  (»FOR«> 

FOR  I:*l  TO  NOGUESSES  DO 
BEGIN  (*FOR») 

CHPHONECI,J]s*ICHPHONECI,J]? 

CHVALUECI, J3s«ICHVALUE(I,J3? 

END  («FOR«); 


SFACT0RCJ3 :sISFACT0R[J3 ; 
SCOREC J3 :  =  ISCOREC J3 ; 
PHONEtJ3ssIPHQNECJ3 ; 
VERROREJ3 :  =  IVERRORCJ3 ; 

IF  CHPHOHEt 1 ,J3  <>  0  THEN 
BEGIN  (»IF*) 

ENDWORD:  = J ; 

END  < *IF* ) ; 

END  (*FOR*); 

END  (#WITH») ; 


{If  speech  file  is  longer  than  one  record,  initialize 
the  second  half  of  the  array.} 

IF  IOUT.CONTFLAG  =  1  THEN 
BEGIN  ( *  I F* ) 

GET  (SPECH) ; 

IOUT:=SPECHA; 

WITH  IOUT  DO 
BEGIN  ( #WITH* ) 

FOR  J  s 5 1  TO  KCRDLENCTH  DO 
BEGIN  < «FOR«) 

FOR  I:?  1  TO  NOGUESSES  DO 
BEGIN  (*FOR*) 

CHPHONECI ,J*RCRDLENGTH3:=ICHPHONEII  ,J3; 
CHVALUEtl ,J+RCRDLENGTH3 :sICHVALUEtI  ,J3  $ 
END  ( *FOR# ) ; 

SFACTQRLJ+RCRDLENGTH3  :  =  ISFACTORt J3 ; 

SC0RECJ  +  RCRDLENGTH3 :  =  ISC0RECJ3 ; 

PHONEt J+RCRDLENGTH3 :: IPHQNECJ3 ; 

VERRORC J  +  RCRDLENGTH3  s  =  IVERRORCJ3 ; 

IF  ICHPHONEI 1 ,  J3  (>  0  THEN 
BEGIN  (*IF») 

ENDWORD:=J+RCRDLENGTH; 

END  (*IF»); 

END  (*F0R») 

END  (*WITH* ) ; 

END  (»IF»); 

END  ^PROCEDURE  GET2REC0RDS* ) ? 


{«*#****###»#»**#**#**.*%#»#**#*#*#*#**#******#####**»*##**##«*##*»«*«*« 
PROCEDURE  WGE2SGE3  DESCRIPTION 


THIS  PROCEDURE  DERIVES  ITS  NAME  FROM  THE  FACT  THAT  IS  ONLY  CALLED 
WHEN  WORDLENGTH  -  W  >*  2  AND  ENDUORD-S  >-3.  IN  OTHER  WORDS,  IT 
IS  ONLY  CALLED  WHEN  MORE  THAN  2  PHONEMES  REMAIN  IN  THE 
WORD  PHONEME  REPRESENTATION,  AND  WHEN  THERE  ARE  MORE  THAN  3  VECTORS 
REMAINING  IN  THE  SPEECH  FILE. 

THIS  PROCEDURE  DETERMINES  WHICH  WORD  PHONEME  MOST  LIKELY 


OCCURRED  FOR  THE  CURRENT  VECTOR  OF  SPEECH  <S).  THIS  IS 
BASICALLY  ACCOMPLISHED  BY  EXAMINING  THE  PLAUSIBILITY  OF  THE  CURRENT 
WORD  PHONEME  (y)  OCCURRING,  AS  COMPARED  WITH  THE  PLAUSIBILITIES 
OF  THE  NEXT  TWO  WORD  PHONEMES  (W  +  l  AND  W+2)  OCCURRING  FOR  THE  CURRENT 
VECTOR  OF  SPEECH.  IF  THE  CURRENT  WORD  PHONEME  PLAUSIBILITY 
IS  LESS  THAN  EITHER  OF  THE  NEXT  TWO  PHONEMES,  THEN  THE  NEXT 
"NOTOAVG"  VECTORS  <3  VECTORS  IF  NOTOAVG)  THE  NUMBER  OF  VECTORS  LEFT) 
HILL  BE  EXAMINED  TO  DETERMINE  IF  THE  CURRENT  VECTOR  IS  AN  ACTUAL 
TRANSITION  BETWEEN  WORD  PHONEMES,  OR  IS  AN  INSERTION  ERROR. 


******** ***************************************************************} 


PROCEDURE  WGE2SGE3; 


VECTOR, NtINTEGER; 

BEGIN  OUGE2SGE3*) 

FOR  N:=U  TO  (U+2)  DO 
BEGIN  UFOR*) 

VECTOR:=S; 

FINDMAX  (N, VECTOR, PR, ERROR)? 

PHSCRCNIjsPR; 

TERRORIN3 :=ERROR; 

END  UFOR*); 

IF  (PHSCRCWDPHSCRCW+II)  AND(  PHSCRCWI )  =  PHSCR[W+23  )  THEN 
BEGIN  UIF*) 

PWISMAX; 

END  (*IF*) 

ELSE  IF  (PHSCRCW]<=PHSCRCU+13)AND(PHSCR[W+n>-PHSCRtW+2D)  THEN 
BEGIN  UELSE*) 

PH :=PHSCR[W] ; 

PW1 : =PHSCRtW+l ] ; 

IF  (ENDUORD-S- NOTOAVG )  )  =  0  THEN 

NOAVG:=NOTOAVG 
ELSE 

N0AVG::3; 

FOR  VECTOR s  =  < S  + 1 )  TO  (S  +  NOAVG)  DO 
BEGIN  UFOR*) 

N:sW; 

FINDMAX(N, VECTOR, PR, ERROR); 

PM:sPW+PR; 

N :  =  W+ 1 ; 

FINDHAX(N, VECTOR, PR, ERROR); 

PW1  :*PW1+PR; 

END  UFOR*); 

PW:^PW/(NOAVG+l) ; 

PK1 : sPWl/ (NGAVG  +  1 ) ; 

IF  ( ( THR 1  EMPOWER (PW , THR 1 F ) ) >  =  P W 1 )  THEN 
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BEGIN  <»IF») 

P W I SH AX  5 
END  <*IF«) 

ELSE 

BEGIN  (*ELSE») 

PW1ISHAX; 

END  UELSE*); 

END  t *ELSE* ) 

ELSE  IF  (PHSCR[«+2]>PHSCR[U])AND(PHSCRCU+2]>PHSCRt«+l])  THEN 
BEGIN  (*ELSE«) 

PW:  =  PHSCRtH] ; 

PNl:sPHSCRCU+l]; 

PW2s  =  PHSCRCW+23 ; 

IF  (ENDWORD-S-NOTGAVG)  >=  0  THEN 
NOAVG  :sNOTOAVG 
ELSE 

NQAVG:=3; 

FOR  VECTOR s  =  ( S+l )  TO  (S+NQAVG)  DO 
BEGIN  (*FOR») 

N:*U; 

FINDHAX(N, VECTOR, PR, ERROR); 

PHs-PW+PR; 

N:-U+l ; 

FINDHAX  (N, VECTOR, PR, ERROR); 

PW1 a=PWl+PR; 

N:sU+2; 

FINDNAXIN, VECTOR  H, ERROR); 

PH2:sPU2+PR; 

END  (*FOR») ; 

PH:=PW/(NOAVG+l); 

PW1 :  =  PUl/(NOAVG+l ) ; 

PU2:  =  PU2/(N0AVG+1) ; 

IF  ( (THR1 EMPOWER (PU ,THR1F ) ) )=PH1 )AND 

((THR2E»P0WER(py,THR2F)))-PW2)  THEN 
BEGIN  ( *  I F* ) 

PUISHAX; 

END  ( #IF* ) 

ELSE  IF  (PW1 > <THR1E<POWER<PW ,THR1F) ) )  AND 

( (THR2E*P0WER(PWl ,THR2F ) ) >  =  PW2)  THEN 
BEGIN  ( *ELSE* ) 

PWIISHAX; 

END  UELSE*) 

ELSE 

BEGIN  (*ELSE*) 

PU2ISNAX; 

END  <«ELSE»); 

END  UELSE*); 


END  < »PROCEDURE  UGE2SGE3») ; 


{*#****#»*#*****#*###**#**«**#*#**#**#*##**#*#*##**#*«***#*******#****** 


PROCEDURE  WEQ1SLE2  DESCRIPTION 


THIS  PROCEDURE  IS  CALLED  WHEN  THERE  ARE  ONLY  TWO  WORD  PHONEMES 
REMAINING  AND  WHEN  THERE  ARE  LESS  THAN  3  VECTORS  OF  SPEECH  REMAINING, 
UNLIKE  THE  PREVIOUS  PROCEDURE,  THE  DECISION  OF  WHETHER  OR  NOT  A 
TRANSITION  OCCURRED  IS  BASED  SOLEY  ON  THE  PLAUSIBILITY  OF  THE 
REMAINING  PHONEMES  OCCURRING  FOR  THE  CURRENT  VECTOR.  IF  A  TRANSITION 
OCCURS,  THEN  THE  PLAUSIBILITIES  OF  THE  LAST  PHONEME  OCCURRING 
FOR  THE  REMAINING  VECTORS  OF  SPEECH  ARE  COMPUTED. 


*«*«««*« *««*« «*****«*« a#*#*##*###*###*****##*###**####*#**#*##*#*##***#* 


PROCEDURE  UEQ1SLE2; 

VAR 

NsINTEGER; 

BEGIN  UUEQ1SLE2*) 

FOR  N:sW  TO  (W+l)  DO 
BEGIN  ( *FOR# ) 

VECTOR:=S; 

FINDMAX  (N, VECTOR, PR, ERROR); 
TERRORCN]:=ERROR; 

PHSCRCN3:=PR; 

END  (#FOR*) ; 

IF  PHSCRCW] >PHSCR[W+13  THEN 
BEGIN  (#IF«) 

PUISMAX; 

IF  (ENDWORD-SXO  THEN 
BEGIN  (*IF») 

N:  =  W+1 ; 

WDELETED(N) ; 

END  <*IF*> 

END  («IF«) 

ELSE  IF  PHSCRtW+lIXPHSCRCW]  THEN 
BEGIN  (*ELSE») 

PW1ISMAX; 

WHILE  (ENDWORD-S) >*0  DO 
BEGIN  ( * WH I LE* ) 

VECTOR s-S; 

FINDMAX(W, VECTOR, PR,  ERROR); 
PHSCRCU3:sPR; 
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TERRORCWI  TERROR; 
PWISMAX; 

END  ( #WHILE* ) ; 

END  t«ELSE«) ? 


END  <*WEQ1SLE2»); 


{it##*##**#**#******##***##*#*##**#*##****#********#*#**#*#**##*##*####* 


PROCEDURE  WEQ1SGE3  DESCRIPTION 


THIS  PROCEDURE  IS  ONLY  CALLED  WHEN  THERE  ARE  TWO  WORD  PHONEMES 
LEFT  AND  WHEN  THERE  ARE  NORE  THAN  2  VECTORS  OF  SPEECH  REGAINING. 

THIS  PROCEDURE  IS  SIMILAR  TO  PROCEDURE  WGE2SGE3,  EXCEPT  THAT  WHEN  A 
A  TRANSITION  TO  THE  LAST  WORD  PHONENE  OCCURS,  THE  PLAUSIBILITIES  FOR 
THE  LAST  WORD  PHONEME  OCCURRING  FOR  THE  REMAINING  VECTORS  OF  SPEECH  IS 
COMPUTED. 


a#***#*####***##**#*#* a#*###*##*#**#*#*#*###*##*#*#**#**#**#*###*#**###*} 


PROCEDURE  WEQ1SGE3; 

VAR 

VECTOR, NsINTEGER; 

BEGIN  UWEQ1SGE3*) 

FOR  N :  =  U  TO  (W+l)  DO 
BEGIN  (*FQR») 

VECTOR:*S; 

FINDMAX  (N, VECTOR, PR, ERROR); 
PHSCRCNIs-PR; 

TERRORCN]s=ERROR; 

END  (»FOR») ; 

IF  PHSCRm>:PHSCRCW+l]  THEN 
PWISMAX 

ELSE  IF  <PHSCR[W]<PHSCRCW+1]>  THEN 
BEGIN  ( *ELSE») 

PW:sPHSCRCWI ; 

PWlssPHSCRtW+13; 
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IF  (ENDUORD-S-NOTOAVG)  >=  0  THEN 
NOAVG : sNOTOAVG 
ELSE 

N0AVGi=3* 

FOR  VECTOR i  =  ( S+l )  TO  (S+NOAVG)  DO 
BEGIN  (#FOR«) 

N ;  =  W ; 

FINDNAX  <N, VECTOR, PR, ERROR); 
F'H:-PW+PR; 

N:-U+l ; 

FINDNAX  (N, VECTOR, PR, ERROR)? 
PMlt>PUWPR; 

END  (*FOR»); 

PW:=PU/(NOAVG+l) ; 

PHl:sPWl/(NOAVG+l)  \ 

IF  (THR1E«P0WER(PW,THR1F))>PW1  THEN 
BEGIN(«IF«) 

PUISHAX; 

END  C  »IF* ) 

ELSE 

BEGIN  (*ELSE«) 

PW1ISHAX* 

WHILE  ( ENDUORD-S ) ) =0  DO 
BEGIN  ( *WHILE* ) 

VECTOR:=S* 

FINDNAX (w)vECTOR , PR, ERROR)  \ 
PHSCRC W] ;  =  PR ; 
TERRORCUIi-.ERROR; 

PWISNAX ; 

END  (#WHILE«)  \ 

FLAG: =FALSE; 

END  (»ELSE*>  \ 

END  («ELSE*)? 


END  ^PROCEDURE  WEQ1SGE3* ) ; 


{#*##*##»##*#*«###*#»#»#####*#*#####*###*##*»*****«*»**»####**#*#**##* 


PROCEDURE  UGE2SLE2  DESCRIPTION 


THIS  PROCEDURE  IS  CALLED  WHEN  NORE  THAN  2  WORD  PHONENES 
REHAIN  AND  LESS  THAN  3  VECTORS  OF  SPEECH  REMAIN .  THIS  PROCEDURE 
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ONLY  USES  INFORMATION  CONCERNING  THE  CURRENT  VECTOR  TO  DETERMINE 
WHETHER  OR  NOT  A  TRANSITION  OCCURS. 


XKXKXXftOffffXXXftJtXXXftftftXftKftXttXXXSt******  ««««**«  XXttttffttftttttXXXXKftKKKXftXftftftKX 


PROCEDURE  WGE2SLE2; 


N ; INTEGER ; 

BEGIN  («UGE2SLE2«) 

FOR  N:  =  W  TO  (W+2)  DO 
BEGIN  (*FOR») 

VECTOR  t  =  S ; 

FINDHAXIN, VECTOR, PR, ERROR); 

PHSCRIN3 :=PR; 

TERRORCN3:=ERROR; 

END  (»FOR») ; 

IF  ( ( PHSCR [ W1 >PHSCR[U+1 ] ) AND ( PHSCRC W) >PHSCRCW+23 ) )  THEN 
PWISHAX 

ELSE  IF  ( (PHSCRCV+l]>  =  PHSCR[U])AND(PHSCRCW  +  l]>  =  PHSCR[W  +  2]) )  THEN 
PW1ISHAX 

ELSE  IF  ( (PHSCRCH+2]  )  =  PHSCRC  W] )  AND  (PHSCRC  W+23  >  PHSCRC  U-x  1 3 ) )  THEN 
PW2ISMAX; 

IF  (ENDUORD-S) <0  THEN 
BEGIN  ( * I F * > 

WHILE  (WORDLENGTH-W) >0  DO 
BEGIN  («WHILE») 

NssW; 

WDELETED(N) ; 

W:SW+1; 

END  (*WHILE»); 

END  (*IF*); 


END  UPROCEDURE  WGE2SLE2*); 


{in***#*#**##*******#*####*#**#*****#***#*##*#**#***#***##*#****#****** 


*r> 


PROCEDURE  COMPUTESCORE  DESCRIPTION 


THIS  PROCEDURE  CONFUTES  THE  TOTAL  WORD  SCORE  BASED  ON  THE 
VALUES  OF  THE  FOLLOWING  VARIABLES:  DELCNT ,  WOKDLENGTH ,  DELW , 

PWORD t  AND  SCORECOUNT,  REFER  TO  THE  GLOBAL  VARIABLE  DESCRIPTIONS t 
PROVIDED  AT  THE  BEGINNING  OF  THE  H  A I N  PROGRAM,  FOR  AN  EXPLANATION 
OF  EACH  OF  THESE  VARIABLES. 


««*»»*»«K»»*K**«*«*«**«*«  If***#*######***#**#*#**##***#*###**** **»«###**> 


PROCEDURE  COMPUTESCORE; 
VAR 

ARGUMENT:REAL; 

BEGIN  UCOMPUTESCORE*) 


(Compute  the  total  deletion  score.) 

IF  DELCNTOO  THEN 
BEGINUIF*) 

ARGUMENT  s  =  l-C((l-(DELCNT/UORDLENGTH))*DCNG)  +  <l-DCNG ) * <  DELW/DELCNT ) ) ? 
ARGUMENT:* ARGUMENT/DCNE; 

WORDSCORE: =1 -POWERt ARGUMENT, DCNF); 

END ( * I F* ) 

ELSE 

BEGINUELSE*) 

WORDSCORE:*!; 

ENDUELSE#) ; 

{Coipute  the  total  word  score.) 

WORDSCORE :*( DELG*WORDSCORE)+ ( 1-DELG)* (PWORD/ SCORECOUNT) ; 

END(«PROCEDURE  COMPUTESCORE#); 
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{a***#**###*##*#*****##**###*#*#**##***#**##**********#*********#******** 


PROCEDURE  FINDHAX  DESCRIPTION 


THIS  PROCEDURE  DETERMINES  THE  PLAUSIBILITY  OF  A  GIVEN  WORD  PHONEME 
OCCURRING  FOR  A  GIVEN  VECTOR  OF  SPEECH,  THE  PLAUSIBILITY  IS 
CALCULATED  BASED  ON  WHETHER  A  SUBSTITUTION  ERROR  OR  AN  INSERTION 
ERROR  OCCURS.  A  SUBSTITION  ERROR  IS  SAID  TO  HAVE  OCCURRED  IF  THE  PHONEME 
BEING  EXAMINED  IS  AMONG  THE  NUMBER  OF  GUESSES  PROVIDED,  IF  NOT, 

AN  INSERTION  ERROR  IS  SAID  TO  HAVE  OCCURRED.  ONCE  THE  TYPE  OF  ERROR  IS 
DECIDED,  THEN  A  SCORE  IS  CALCULATED  FOR  EACH  OF  THE  GUESSES,  BASED 
ON  THE  VALUE  ASSIGNED  TO  EACH  GUESS  BY  THE  ACOUSTIC  PROCCESSOR,  AND  ON  THE 
STATISTICS  COLLECTED  FROM  THE  TRAINING  SET.  THE  MAXIMUM  OF  THESE  SCORES  THEN 
BECOMES  THE  PLAUSIBILITY  FOR  THE  WORD  PHONEME  BEING  EXAMINED. 


»*«»»»«••**•«•«** as *••**« #####*####*######»# a*#####*#*#***##*###*####*##*##*} 


PROCEDURE  FINDMAX; 

VAR 

VSCORE, SUBSCORE, AI NSSCORE : REAL ; 
MAXSCORE: ARRAY  Cl, • NOGUESSES 3  OF  REAL; 
HAX:REAL; 

IMAXsINTEGER; 

K  ? INTEGER ; 

I  ,J:INTEGER; 

ARGUMENT :REAL; 


BEGIN  («FINDHAX«) 

WITH  USTATS  DO 
BEGIN  (*UITH#) 

K;*VECTOR-OFFSET ; 

(Determine  if  an  insertion  or  substitution  error 
occurred  for  the  word  phoneme  for  the  given  vector.) 

ERROR!*' INSRT’ ; 

FOR  I:*l  TO  NOGUESSES  DO 
BEGIN  (#FOR«) 

IF  (WDATA.PHREPCN]*CHPHONECI,K]>  THEN 
ERROR:*'SUBST' ; 

IF  (XSUBYCN,CHPHONECI,K]]))<XSUBYCN,WDATA,PHREP[N)]#STHR)  THEN 
BECINU1F*) 

IF  <XSUBYCN,CHPHONEC I , K)] >0)  AND  <N< (MAXPHONES) )  THEN 


BEGIN  <  *1F*  > 

IF  <CHPHONEtI,K3<>«DATA.PHREPCN+13>THEN 
BEGIN ( «IF* ) 

ERRORs-'SUBST' ; 

ENDUIF«); 

END ( *  1 F* ) ; 

IF  N*MAXPHQNES  THEN 
BECIN(*1F*> 

ERROR:  =  ‘ SUBST' ; 

END(*1F») ; 

END(»IF«) ; 

END  <*FOR*); 

IF  ERROR1 ‘ SUBST'  THEN 
BEGIN  (*IF*) 

(Conpute  the  vector  score  given  that  a  substition 
error  occurred.) 

FOR  I:=l  TO  NOGUESSES  DO 
BEGIN  UFOR*) 

ARGUMENT  s  =  SFACTORCK3/SFE ; 

VSCORE:=( 1-CHVALUECI ,KJ/100)/CHVE; 

VSCORE:=<  1-POWER  ( VS'CORE ,  CHVF  ))*(  1 -POWER  ( ARGUMENT,  SFF)); 

IF  XSUBYIN /CHPHONEtl >X33( >0  THEN 
BEGIN(*IF*) 

ARGUMENT:  =  ( 1- (XSUBYCN  ,CHPHONECI ,K33/(NOSPOKE-NDELCN3) ) )/ STATE 
SUBSC0RE:*1-P0WER(ARGUMENT,STATF); 

END  t  *  I F* ) 

ELSE 

BEGIN(*ELSE*) 

SUBSC0RE:*O; 

END ( »ELSE» ) ; 

ARGUMENT  :  =  ( ( < l-STATG)#VSCORE)+( (STATG)»( SUBSCORE) ) )/SUBE; 
HAXSC0REU3  :  =  POWER<  ARGUMENT,  SUBF); 

END  (»FQR») ; 

END  (»IF«) 

ELSE  IF  ERROR* ' INSRT'  THEN 
BEGIN  ( »ELSE») 

(Compute  the  vector  score  given  that  an  insertion 
error  occurred.) 

FOR  I  :*1  TO  NOGUESSES  DO 
BEGIN  ( *FOR«) 

ARGUMENT :*SF ACTOR CK3/SFE; 

VSCORE:* ( 1-CHVALUEC I >13/100) /CHVE; 

VSC0RE:=(1-PQUER<VSC0RE,CHVF))*<1-P0UER< ARGUMENT,  SFF)); 

IF  XINAFTZCN,CHPH0NECI,X33<>0  THEN 
BEGIN(*IF*> 

ARGUMENT  s  =  { 1 - ( X I N A FT Z C N , CHPHONE C I , K 3  3 /NOSPOKE ) ) /STATE; 
AINSSC0RE:*1 -POWER (ARGUMENT  >STATF) ; 

END  <  * I F* ) 

ELSE 

BEGIN(»ELSE») 


AINSSCORE:sO; 

END ( *ELSE* ) ; 

ARGUMENT  s  =  < ( ( 1 - ST ATC >  «VSCORE )  +  <<STATC)*AINSSCORE))/INSE; 
HAXSCQRE[I];sPOUER( ARGUMENT ,INSF) ; 

END  <*FOR«) ; 

END  UELSE*); 

{Coapute  the  overall  vector  score  which  is  equal  to 

the  average  vector  score  for  all  the  choices  given  for  the 

vector.} 

PR:=0; 

FOR  1 5=1  TO  NOGUESSES  DO 
BEGIN  (»F0R») 

PR:  =  PR+MAXSC0REC1D ; 

END  <*FOR*); 

PR:*PR/NOGUESSES; 

END  t  *y ITH* ) ; 

END  (^PROCEDURE  FINDHAX* ) ; 


I##*#*###***###*#**##***####*#*##**#*#****###**##**###*#****##*##*#**# 
PROCEDURE  PUISNAX  DESCRIPTION 


THIS  PROCEDURE  IS  CALLED  WHEN  IT  IS  DECIDED  THAT  THE  CURRENT 
WORD  PHONEME  OCCURRED  FOR  THE  CURRENT  SPEECH  VECTOR.  THE  PURPOSE 
OF  THIS  PROCEDURE  IS  TO  STORE  THE  SCORE,  PHONEME  CHOSEN,  AND  THE  ERROR 
THAT  OCCURRED  FOR  THE  GIVEN  VECTOR;  AND  TO  INCREMENT  THE 
COUNT  THAT  INDICATES  THE  CURRENT  VECTOR  (S).  IF  THE  VECTOR  NUMBER 
IS  ABOVE  A  CERTAIN  VALUE,  THE  NEXT  SPEECH  RECORD  IS  RETRIEVED  AND 
PLACED  IN  MEMORY, 


*####»»**»*###»***«#**#»#*###**##»##**#»*#»##****#*»*«#****##***#*»*#**} 


PROCEDURE  PUISNAX; 
VAR 

K  s INTEGER ; 
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BEGIN  (ttPUISHAX*) 


KjsS-OFFSET? 

SCOREIKI  :  =  PHSCRIH] ; 

PHONEIK] :=WDATA.PHREPCWI ; 

VERRDRCK] :-TERROR[U] ; 

SCORECOUNT  JsSCORECOUNT+l ; 

PttORD :  =  PWORD+PHSCRC W3 ; 

IF  VERRORCKI1 ' INSRT '  THEN 
BEG 1H(*IF«) 

INSRTSCORE:sINSRTSCORE+SCOR£CK] ; 

END ( * I F  *  > ; 

Ss=S+l ; 

IF  ( (S-QFFSET) >=70)AND< I0UT«C0N7FLAG= 1 )  THEN 
BEGIN  (*IF«) 

GETRECORD; 

END  (*IF«); 

END  ^PROCEDURE  PWISMAX* ) ; 


PROCEDURE  PW1ISMAX  DESCRIPTION 


THIS  PROCEDURE  IS  CALLED  WHEN  A  TRANSITION  OCCURS  BETWEEN  THE 
CURRENT  ORD  PHONEHE  AND  THE  NEXT  ONE  ,  THE  PURPOSE  OF  THIS  PROCEDURE 
IS  TO  STORE  THE  SCORE,  PHONEHE  CHOSEN,  AND  THE  ERROR  THAT 
OCCURRED  FOR  THE  GIVEN  VECTOR;  AND  TO  INCREMENT  THE  COUNT 
THAT  INDICATES  THE  CURRENT  VECTOR  <S>.  IF  THE  VECTOR  NUMBER 
IS  ABOVE  A  CERTAIN  VALUE,  THE  NEXT  SPEECH  RECORD  IS  RETRIEVED  AND 
PLACED  IN  MEMORY.  A  CHECK  IS  ALSO  MADE  TO  DETERMINE  IF  A 
SUBSTITUTION  ERROR  OCCURED  FOR  THE  CURRENT  PHONEME,  IF  NOT, 

A  DELETION  ERROR  IS  SAID  TO  HAVE  OCCURRED,  AND  A  DELETION  SCORE  IS 
COMPUTED. 


imHHIlHHHmiHHHHttlilOHHtlflHIHHDIHHHimiHIHItHOI} 


PROCEDURE  PW1ISMAX; 
VAR 


X / J  x INTEGER ; 


BEGIN  UPW1ISMAX*) 


fT 


X:sS-0FFSET ; 

JssS-OFFSET-l; 

IF  J  =  0  THEN 
BEGIN ( « I F* ) 

J:*l; 

PH0NEC1D  s  =  0; 

VERR0R!13:s'XXXXX' ; 

END(*IF»); 

WHILE  ( VERRORL J3  = '  INBRT* )  AND  ( JMBEG1NW0RD+0FFSET) )  DO 
BEGIN  (*WHIL£*) 

J  s  =  J- 1 ; 

END  UWHILE*); 

IF  PHONE! JI<  >WDATA .PHREPCU3  THEN 
BEGINULFO 

UDELETED  (W); 

END  ( « IF* ) ; 

Ws*W+l; 

PWORD:-PWORD+PHSCRCW3 ; 

SCORECOUNT : =SC0REC0UNT+1 5 
SCOREIK3 ::PHSCR!W3 ; 

PHONECK3 : =WDATA.PHREPCU3 ; 

VERRORI XI :: TERROR CU1; 

IF  MERRORCKD= ' INERT '  THEN 
BEGINUIF*) 

INSRTSCORE:*INSRTSCORE+SCQRECX3; 

END(*IF«); 


S :  =  S+1 ; 

IF  C  tS-OFFSET>  >=70> ANIMIOUT  «C0NtFLAG= 1 )  THEN 
BEGIN  <*IF*> 

CETRECORD; 

END  ( *IF* ) ; 

END  (^PROCEDURE  PW1ISHAX»); 


{#*###*##«##*####*##**###*#«»##*##***»#*#**#***####*##»*****#*#«####*# 
PROCEDURE  PW2ISHAX  DESCRIPTION 


THIS  PROCEDURE  IS  CALLED  WHEN  A  TRANSITION  OCCURS  BETWEEN  THE 
CURRENT  WORD  PHONEME  AND  THE  PHONEME  FOLLOWING  THE  NEXT  ONE. 

THE  PURPOSE  OF  THIS  PROCEDURE  IS  TO  STORE  THE  SCORE f  WORD  PHONEME 
CHOSEN t  AND  THE  ERROR  THAT  OCCURRED  FOR  THE  GIVEN  VECTOR;  AND  TO 
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INCREMENT  THE  COUNT  THAT  INDICATES  THE  CURRENT  VECTOR  (S).  IF  THE 
VECTOR  NUMBER  IS  ABOVE  A  CERTAIN  VALUE,  THE  NEXT  SPEECH  RECORD  IS 
RETRIEVED  AND  PLACED  IN  MEMORY.  A  CHECK  IS  ALSO  MADE  TO  DETERMINE 
IF  A  SUBSTITUTION  ERROR  OCCURED  FOR  THE  CURRENT  PHONEME,  IF  NOT, 

A  DELETION  ERROR  IS  SAID  TO  HAVE  OCCURRED,  AND  A  DELETION  SCORE  IS 
COMPUTED.  A  DELETION  SCORE  IS  ALSO  COMPUTED  FOR  THE  WORD  PHONEME 
THAT  WAS  SKIPPED. 


XXXXXXXXXXXKXXXXXXXXXXXXXXXXXXXXX ******** XXXXXXXXXXXXXXXXXXXXXXX KXXXXXX} 


PROCEDURE  PW2ISHAX; 

VAR 

K,J :INTEGER; 

BEGIN  UPW2ISMAX*) 

K:=S-OFFSET; 

J:  =  S-OFFSET-l ; 

IF  J=0  THEN 

BECIN(xIFx) 

J  »  "  1  % 

PHONEC1D :=0; 

VERRORI II : = ’ XXXXX' 5 
END ( *IF« ) ; 

WHILE  ( VERRORI J]= ' INSRT' ) AND( J) (BEGINWORD+OFFSET) )  DO 
BEGIN  (*WHILE») 

J  5  =  J-l ? 

END  (*UHILE«); 

IF  PHONEIJ]  <>  UDATA.PHNEPIW3  THEN 
BEGIN  (*IF«> 

WDELETED  (W); 

END  (»IF«)} 

W :  =  W+ 1 ; 

WDELETED  (W>; 

PWORD:  =  PWORD+PHSCRIW  +  l  ]  5 
SCORECOUNT:sSCORECOUNT+l ; 

W:  =  W+1 ; 

SCORECK] ssPHSCRCW3 ; 

PHONECK3 : =WDATA .FHREPC W3 ; 

VERRORCKO  s  =  TERRORCW3 ; 

IF  VERRORIK35 ' INSRT'  THEN 
BEGIN(»IF«) 

INSRTSCOREs  =  INSRTSCORE+SCORECK3 ; 

END t  * IF« ) ; 


Ss*Sxl? 
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IF  ((S-0FFSET)>=70)ftND(I0UT.C0NTFLAi=l)  THEN 
BEGIN  ( *  I F * ) 

GETRECORD; 

END  <*IF*); 

END  (^PROCEDURE  PW2ISMAX* ) ; 


{XXX*X********#**»**»***#»##******X*»***X**X»*********X»****«**XX*»*»X 

PROCEDURE  WDELETED  DESCRIPTION 


THIS  PROCEDURE  COMPUTES  A  DELETION  SCORE  FOR  A  DELETED  WORD 
PHONEME  BASED  ON  THE  NUMBER  OF  TIMES  THE  PHONEME  WAS  DELETED  IN  THE 
TRAINING  SET, 


XXXXXXXXXXXXXXXXXXXXXXXKXXXXXXftXXXKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX} 


PROCEDURE  WDELETED; 

VAR 

ARGUMENT:REAL; 

BEGIN  (xWDELETEDx) 

DELCNT  s  =  DELCNT+l ; 

ARGUMENT: s (WSTATS.NDELt N3/WST ATS. NOSPOKE )/DELE; 
DELW:  =  POWER( ARGUMENT, DELF)  +  DELW; 

END  ^PROCEDURE  WDELETED*) ; 


{XXXXXXXXXXXXXXKXXXXXXXXXXXXXXKKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

PROCEDURE  GETRECORD  DESCRIPTION 

THIS  PROCEDURE  RETRIEVES  THE  NEXT  RECORD  OF  THE  SPEECH 
FILE,  AND  PLACES  IT  IN  THE  APROPRIATE  ARRAYS. 

XXXX#*XXXXXX*XX*XX#XX*XXXX#X*X**X****#XXXXXXXXXXXXXXX*X***XXXXXXXXXX*} 
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1  -  . 

1  AD-A124  851  ISOLATED  WORD 

1  FORCE  INST  OF 
i  ENGINEERING  G 
I  UNCLASSIFIED 

RECOGNITION  USING  FUZZV  SET  THEORV(U)  RIR  2/4  HII 1 
TECH  HRIGHT-PRTTERSON  RFB  OH  SCHOOL  OF  1 
J  HONTGOHERV  DEC  82  RFIT/GE/EE/82D-74  1 

F/G  5/8  NL  I 

!■ 

- 

i 


MICROCOPY  RESOLUTION  TEST  CHART 

NATIONAL  BUREAU  Of  STANDARDS-1963-A 


PROCEDURE  GETRECORD', 
VAR 

J  f I : INTEGER ; 

BEGIN  (*GETRECORD») 


SEEK (SPECH/l RECORD ) ; 

CET(SPECH); 

IOUT5=SPECHa? 

OFFSET:=OFFSET+RCRDLENGTH; 

{Assiqn  first  half  of  record  array  the  values 
stored  in  the  second  half  of  the  record.) 

FOR  J:*l  TO  RCRDLENGTH  DO 
BEGIN  («FOR«) 

FOR  1**1  TO  NOGUESSES  DO 
BEGIN  UFOR*) 

CHPHONECI | J] :  =  CHPHONE  C I , J+RCRDLENGTH3 ; 
CHVALUEC I , J] :  =  CHVALUEt I  ,J+RCRDLENGTH3 ; 
END  UFOR*)? 

SFACT0R!J3:*SF ACTOR! J+RCRDLENGTH3? 

IOUT . ISCOREIJ3 :=  SCQREU3? 

IOUT « IPHONE! J]:*PHONE!J3  ? 
IOUT.IVERROR!J3:=VERROR!J3; 
SCOREIJ3:sSCORE! J+RCRDLENGTH3  ? 

PHONECJ3  :  =  PHONE! J+RCRDLENGTH3  ? 

VERRORI J3  5*VERRQRI J+RCRDLENGTH3  ? 

END  UFOR*)? 

SEEK (SPECH;1 RECORD) ? 

SPECH‘:*IOUT? 

PUT(SPECH) ? 


{Assign  second  half  of  array  the  values  of  the 
next  record  of  the  speech  file.) 

I  RECORD :  =  I RECORD* 1 ; 

SEEKtSPECH ; ( IRECORD+1 ) ) ; 

GET(SPECH) ? 

IQUT:=SPECH‘? 

WITH  IOUT  DO 
BEGIN  UUITH*) 

FOR  JsM  TO  RCRDLENGTH  DO 
BEGIN  UFOR*) 

FOR  1**1  TO  NOGUESSES  DO 
BEGIN  UFOR*) 

CHPHONE! 1 ,J+RCRDLENGTH3 i = I CHPHOHE! I ,  J3 
CHVALUEII f J+RCRDLENGTH3 : = ICHVALUELI ,J3 
END  UFOR*)? 

SFACTORCJ+RCRDLENGTH3  **ISF ACTOR! J3  ? 

SCORE! J*RCRDLENGTH3;*ISCORE!J3? 

PHONE! J+RCRDLENGTH3 s *  IPHONE! J  3 ? 


VERRORIJ+RCRDLENGTH] :=IVERROR[J]; 
IF  ICHPHONEC 1 (>  0  THEN 
BEGIN  U1F*> 

ENDWORD:* J+OFFSET+RCRDLENGTH; 
END  (*IF»>; 

END  («FOR«); 

END  (*WITH»); 


END  ^PROCEDURE  GETRECORD#); 


BEGIN  <«SCOREWORD«) 

WRITELN( ' SCO REWORD' ) ; 

IRECORD:=BEGINREC; 


Unitialize  offset.) 

OFFSET:*®; 

WHILE  (OFFSET+RCRDLENGTH) (BEG1NWORD  DO 
BEGIN(*WHILE») 

OFFSET :*OFFSET+RCRDLENGTH; 
IRECORD:*IRECORD+l; 

END ( * WHI LE* ) ; 

RESET(SPECH); 

GET2REC0RDS; 

RESET ( WF ILE ) ; 

RESET(SFILE) ; 


{Initialize  all  prograa  variables.) 

TEPIPWORD:*'XXXXX' ; 

W0RDN0:*0; 

WHILE  (NOT  EOF ( WFILE) )  AND  (W0RD<  >TEHPW0RD)  DO 
BEGIN  («UHILE«) 

SEEK ( WFILE fUORDNO) ; 

GET(WFILE) ; 

WDATA:*WFILE*; 

GET(WFILE) ; 

TENPWORD :  =  WDATA. SPELLING; 

WORDNO:=WORDNO+1; 

END  UWHILE*); 

WORDNO:*WORDNO-l ; 

WORDLENGTH:*0; 

FLAG:*FALSE; 
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REPEAT 

UORDLENGTH :  =  UORDLENGTH  +  l ; 

IF  UDATA.PHREPCUORDLENGTH^O  THEM 
BEGIN ( *  I F* ) 

FLAG:=TRUE; 

UORDLENGTH := UORDLENGTH- 1; 

END ( *IF* ) ; 

IF  UORDLENGTH -HAXPHONES  THEN 
FLAG  r  =  TRUE ; 

UNTIL  FLAG; 

SEEX(SFILE ,UORDNO) ; 

CET(SFILE); 

USTATS:=SFILE* ; 

IF  USTATS<N0SP0XE:0  THEN 
BEGIN ( *IF* ) 

USTATS*  NOSPOKE :  =  1 5 
END ( *  IF* ) ; 

S:-BEGIHUORD; 
y  %-j  > 

PU0RD:-0.0; 

INSRTSCORE  :sO .0 ; 

SCORECOUNT :=0 ; 

DELU:=0.0; 

])C|  p mt  »  s  A • 

FOR  I s=l  TO  HAXVECTORS  DO 
BECINUFOR*) 

SCOREtn  s  =  0.0; 

PHONEin  :*0; 

VERR0RCI3: 5 ' XXXXX' ; 

END  («FOR«); 


(Deternine  word  phoneae  score  and  the  error  that 
occurs  for  each  vector  of  the  speech  file  based  on  the 
nuaber  of  word  phoneaes  and  vectors  reaaining  to  be 
exaained.} 

UHILE  < (UORDLENGTH-U) >  =  2)  AND  < <ENDU0RD-S) >  =  3)  DO 
BEGIN  («UHILE«) 

UGE2SGE3; 

END  ( » WHILE* ) ; 

UHILE  ((ENDUDRD  -S)>=0)  DO 
BEGIN  («UHILE*) 

IF  <<ENDU0RD-S)<-  2)AND< (UORDLENGTH-U)  >  0)  THEN 
BEGIN  (»IF») 

UHILE  < (ENDUORD-S) >s0)  AND  ((UORDLENGTH-U)  >  0)  DO 
BEGIN  (eUHILE*) 

IF  (UORDLENGTH-U) )s2  THEN 
UGE2SLE2 

ELSE  IF  (UORDLENGTH-U) = 1  THEN 
UEQ1SLE2; 

END  (*UHILE«); 

END  («IF») 

ELSE  IF  ( (ENDUORD-S) >s3) AND  ( (UORDLENGTH-U) = 1 )  THEN 
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BEGIN  (*ELSE*) 

HEQ1SGE3; 

ENDUELSE*) 

ELSE  IP  (U0RDLENGTH-U)=0  THEN 
BEGINUELSE*) 

WHILE  (EHDUORD-S) >=0  DO 
BEGIN(*UHILE*) 

VECTOR:^; 

FINDHAX(W, VECTOR, PR, ERROR); 

PHSCREU3 :sPR; 

TERRORCU] :sERROR ; 

PUISI1AX; 

END  UUH1LE*); 

END ( *ELSE* ) 

ELSE 

BEGIN(»ELSE«) 

WR I TELN ( ' ERROR1  IN  PROCEDURE  SCOREUORD.') 
ENDUELSE*) ; 

END  (»UHILE«); 

COHPUTESCORE; 

SEEK (SPECH, I RECORD) ; 

GET(SPECH); 

IOUT:=SPECH*; 

WITH  IOUT  DO 
BEGIN  UUITH*) 

FOR  Js*l  TO  RQRDLENGTH  DO 
BEGIN  (*FOR«) 

1SCORE! J] : ^SCOREIOl; 

IPHONE! J] :  =  PHONEIJ] ; 
IVERROR!J]:=VERROR!J]; 

END  <*FOR*); 

SEEK(SPECH,IRECORD) ; 

SPECH*  j*IOUT; 

Pl)T(SPECH) ; 

IF  IOUT,CONTFLAG=l  THEN 
BEGINUIF*) 

I  RECORD ;sIRECORD+l ; 

SEEK ( SPECH , I RECORD) ; 

GET(SPECH); 

IOUTs=SPECH*; 

FOR  J;:l  TO  RCRDLENGTH  DO 
BEGIN  (*FQR«) 

ISCOREC J] : -SCORE! J+RCRD LENGTH] ; 
IPHONEEJ] :sPHONE!J+RCRDLENGTH]; 
IVERROR! J] :  =  VERRORI J4RCRDLENGTH] ; 

END  <»FOR»>; 

SEEK (SPECH,  I  RECORD) ; 

SPECH*  s*IOUT ; 

PUT(SPECH) ; 

END ( * IF» ) ; 

END  <*HITH«); 

END (*PROCEDURE  SCOREUORD*) ; 


I*##***#*##****#**#***##*#*#**####*##***#*********#**#*#*#***#**###*# 


PROCEDURE  UPDATE  DESCRIPTION 


THIS  PROCEDURE  UPDATES  A  GIVEN  WORD'S  STATISTICS  BASED  OH  THE 
WORD  PHONEHE  CHOSEN  AND  THE  ERROR  THAT  OCCURRED  FOR  EACH  VECTOR  OF 
A  SPEECH  FILE,  WHICH  WAS  DETERMINED  BY  PROCEDURE  SCOREWORD. 

(NOTE  THAT  PROCEDURE  SCOREWORD  MUST  BE  EXECUTED  BEFORE  THIS  PROCEDURE 
IS  CALLED).  SINCE,  IN  CENERAL,  EACH  WORD  PHONEME  WILL  BE  CHOSEN 
FOR  MORE  THAN  ONE  VECTOR,  THE  SUBSTITUTION  AND  INSERTION  STATISTICS 
FOR  EACH  WORD  PHONEME  ARE  COMPUTED  BY  USING  THE  MAXIMUM 
SUBSTITION  AND  INSERTION  SCORES  PER  PHONEHE,  FOR  ALL  THE  VJECTORS 
ASSOCIATED  WITH  A  GIVEN  WORD  PHONEME. 

THE  FOLLOWING  VARIABLES  MUST  BE  INITIALIZED  PRIOR  TO  CALLING 
THIS  PROCEDURE:  WORDNO,  BEGINREC,  BEGINUORD. 


«#*##»*##*##»*##***###****###**##****#»*#*##*####*#*###«#«*#*####*#**} 


SEGMENT  PROCEDURE  UPDATE; 

VAR 

SUB,INS:ARRAYC1 . .NOPHONEHESI  OF  REAL; 
VSCORE, ARGUMENT :REAL; 

I , TEMP, I  NT  ,J:INTEGER; 

OFFSET, S,K:INTEGER; 

TEHPFLAG , FLAG: BOOLE AN; 


BEGIN(»UPDATE«) 

UR1 TELN ( ' UPDATE' ) ; 


{Get  word’s  data  records.) 

RESET(SFILE) ; 

SEEK ( SFILE , WORDNO ) ; 

GET(SFILE); 

USTATS:sSFILEA ; 

RESET(WFILE) ; 

SEE1UUF1LE, WORDNO); 

GET(WFILE) ; 

WDATA : -WFILE* ; 

RESET(SPECH) ; 

IRECORD  :-BEGINREC ; 

OFFSET : =0; 

«H1LE(0FFSET  +  RCRDLENCTHX  =  BEGINU0RD  DO 
BEGIN  UBHILE*) 


OFFSET:1 OFFSET+RCRDLENGTH; 

IRECORD :  =  IRECORD  + 1 ; 

ENDUWHILE#) ; 

SEEK (SPECH/ IRECORD) ; 

GET(SPECH) ; 

IOUT:=SPECHA ; 

WITH  WSTATS  DO 
BEGIN  ( *WITH* ) 

S :  =  BEG I N WORD ; 

NOSPQKE: -NOSPQXE+1 ; 

(Deteraine  the  plausibility  of  each  prototype  phoneae 
being  substituted  or  inserted  for  each  word  phoneae 
for  the  speech  file  being  exaained.) 

FOR  INT a  =  1  TO  WORDLENGTH  DO 
BEGIN  !*F0R») 

FLAG:sFALSE$ 

K  s  =  S - OFFSET  g 

FOR  I  :sl  TO  NOPHONEMES  DO 
BEGIN  !*F0R») 

SUBU1:=0.0; 

INSCI3  s-O.Og 
END  ( *FOR* >  5 
TEHPFLAG:*TRUE; 

WHILE  ( WDATA < PHREPCINT] :1 OUT < IPHONE! K3) AND (S<=ENDW0RD)  AND 
(TEHPFLAG=TRUE)  DO 
BEGIN  UWHILE*) 

WITH  IOUT  DO 
BEGIN  <«WITH«) 

IF  IVERRORCX]*" SUBST*  THEN 
BEGIN  («IF«) 

FLAG:=TRUE; 

FOR  1:»1  TO  NOGUESSES  DO 
BEGIN  <«F0R«> 

TEHP:=ICHPHONECI,K]; 

ARGUMENT :sISFACTORCKl/SFE? 

VSC0RE:M 1-ICH VALUE Cl ,K1/100)/CHVE; 
VSCORE:=!l-POWER(VSCQRE,CHVF))*(l -POWER! ARGUMENT, SFF)) 
IF  ( ( VSCQRE) )SUBCTEMP] )  THEN 
BEGIN  <«IF«) 

SUBCTEMP1 :-VSC0RE; 

END  (*IF«) \ 

END!»F0R«)? 

ENDUIF*) 

ELSE  IF  IVERR0RCK3= ‘ INSRT'  THEN 
BEGIN  <*ELSE*) 

FOR  I  :sl  TO  NOGUESSES  DO 
BEGIN  !*F0R«) 

TEHP  s-ICHPHONEEl ,K3 ; 

ARGUMENT :sISFACTORCK]/SFE; 

VSCORE:s(l-ICHVALUECI  ,K3/100)/CHVE: 

VSCORE:*! 1 -POWER! VSCORE  ,CHVF) )«( l -POWER! ARGUMENT, SFF) ) 
IF  ( (VSCORE) HNSCTEHP] >  THEN 
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BEGIN  (*IF*> 

INStTEHP] :sVSCORE; 

END  («IF«)i 
END  <*F0R*) ; 

END  UELSE*); 

IF  ( < (S-OFFSET ) =RCRDLENCTH) AND(CONTFLAC=l ) 
BEGIN  ( * I F* ) 


{Get  next  record  of  speech  f i I e • > 

GET(SPECH); 

IOUT:  =  SPECH* ; 

OFFSET :=OFFSET+RCRDLENGTH; 

END(*IF») 

ELSE  IF  (S- OFFSET )=RCRDLENGTH  THEN 
BEGINUELSE*) 

TEHPFLAG:sFALSE; 

S:=s-1 * 

ENDUELSE*) ; 

ENDUUITH*)? 

S:=S+1; 

K:sS-0FFSET ; 

END  OHHILE*) ; 

{Test  to  deteraine  if  a  deletion  error  occurred 
for  the  aord  phoneae  being  evaluated*} 

IF  FLAG=FALSE  THEN 
BEGIN  t » I F * ) 

{Update  deletion  statistic  for  aord  phoneae*} 

NDELtINT];=NDELCINT]+l; 

END  (»IF«); 


{Update  aord  phoneae's  insertion  and 
substitution  statistics*} 

FOR  J i = 1  TO  NOPHONENES  DO 
BEGIN  ( *  FOR* ) 

XSUBYCINT ,J]:=XSUBY[INT,J]+SUBCJ]; 
XINAFTZtINT ,J]:  =  XINAFTZtINT , Jl  +  INSCJ]  ; 
END ( «F0R* )  ? 

END  <«F0R«); 

END  (*HITH»); 

SEEK  (SFILE ,W0RDN0> ; 

SF1LE* :=USTATS; 

PUT(SFILE) ; 


END  (sPROCEDURE  UPDATE*) ; 


THEN 
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{**«»»*«*»*«**»***««*«»*»«»»««*»»«****»*«»»**»»««*«»««*«««»«««***«»*« 


PROCEDURE  ISORECOG  DESCRIPTION 


THIS  PROCEDURE  ACCEPTS  THE  NAME  OF  A  SPEECH  FILE  TO  BE  RECOGNIZED 
AND  COMPUTES  A  SCORE  FOR  EACH  WORD  IN  THE  VOCABULARY  BASED  ON  HOW 
NELL  THE  WORD  HATCHES  THE  GIVEN  FILE.  THE  UORD  WITH  THE  HIGHEST  SCORE 
IS  CHOSEN  AS  THE  UORD  THAT  WAS  SPOKEN.  THE  SPEECH  FILE  IS  LIMITED  TO 
ONE  UORD  SINCE  THIS  IS  AN  ISOLATED  UORD  RECOGNITION  ALGORITHM.  ONCE 
THE  RECOGNITION  SCORES  ARE  OBTAINED ,  THE  WORD'S  RECOGNITION 
STATISTICS  ARE  UPDATED  (SINCE  THE  ACTUAL  UORD  SPOKEN  IS  STORED  IN  THE 
SPEECH  FILE,  THERE  IS  NO  PROBLEM  WITH  UPDATING  THE  WRONG  STATISTICS). 
THE  RESULTS  ARE  STORED  IN  THE  FILE  SPECIFIED  BY  THE  USER.  THE  USER 
ALSO  HAS  THE  FOLLOWING  TWO  OPTIONS? 

1.  STORE  THE  SPEECH  FILE  IN  FILE  RECOG  TO  BE  USED  BY 
AN  AUTOMATED  PROCEDURE;  AND 

2.  OUTPUT  THE  ERRORS, PHONEMES  CHOSEN,  AND  THE  SCORES 
OF  THE  VECTORS  IN  THE  SPEECH  FILE  (OBTAINED  BY 
PROCEDURE  SCOREUORD)  FOR  EACH  UORD  IN  THE  VOCABULARY; 

AND  OUTPUT  THE  FINAL  VALUES  OF  PUORD,  SCORECOUNT, 

DELCNT ,  DELU,  AND  UORDSCORE  FOR  EACH  UORD. 


•«»«***«*««**«*«***««»«««*«««»««*«**»«*•*«»»«**•*«»*««««»**««*«*«*«•«} 


SEGMENT  PROCEDURE  ISORECOG; 

VAR 

FILENAME: STRING! 201 ; 

UNUMBER ,NOWORD:  INTEGER ; 
RESULTS  s  TEXT ; 

FLAG:BOOLEAN; 

CRECORD :  INTEGER ; 

INT,I ,J ,IOFFSET: INTEGER; 
TEHPWORD :STR I NGISPEL LENGTH] ; 
ANSUER:STRINGC3] ; 
NAME:STRINGC20] ; 

OUTERRQRS : STRING [3 ] ; 
HIGHSCORE  :REAL ; 

HIGHUORD: STRING! 20]; 
NEXTHIGHEST :  REAL ; 

BEGIN  («ISORECOG«) 

IN1TFILE; 

ENDFLAG:=FALSE; 


UR1 TELN ( ' DO  YOU  UANT  TO  STORE  THE  INPUT  SPEECH  FILE  FOR'); 
WRITELN! 'FUTURE  USE  BY  AN  AUTOMATED  PROCEDURE?  IF  YOU  DO') 
WRITELN! ' ENTER  "YES";  ELSE  ENTER  “NO".'); 

WRITE! ‘ ANSWER  *  '); 

READLNt ANSWER); 

WRITELN; 

IF  ANSWER  <)  ‘YES'  THEN 
BEGIN(*IF«) 

FILETITLE  <  SPECH , ' TEMPFILE' ) ; 

REWRITE(SPECH) ; 

END ( * IF»  > 

ELSE 

BEGINUELSE*) 

FILENAME :*CONCAT( SPEAKER/ RECOG' ) ; 

FILETITLE (SPECH  > FILENAME) ; 

ENDUELSE*); 

RESET(SPECH); 

WRITELN; 

{Get  nane  of  speech  file,  and  assure  that  eord 
is  in  the  current  vocabulary.} 

WRITELN (  INPUT  NANE  OF  SPEECHF1 LE ' ); 

WRITE! *  FILENAME  *  '); 

READLN(NAHE) ; 

WRITELN; 

FILETITLE(0UT2,NAME); 

RESET  (0UT2); 

WITH  IOUT  DO 
BEGIN  OWITHk) 

READLN(0UT2,  SPELL); 

FOR  Ii«  (LENGTH(SPELD)  TO  (SPELLENGTH-i)  DO 
BEGIN  <*FOR«) 

SPELL ; *CONCAT ( SPELL » *  '); 

END  (»FOR«); 

REPEAT 

RESET  (WFILE); 

FLAG:*FALSE; 

TEMPWORD?='  '; 

N0W0RD:-0; 

WHILE  (NOT  EOF (WFILE) ) AND (SPELL!  )TEMPW0RD)  DO 
BEGIN  (*UH1LE«) 

SEEKWFILE, NOWORD); 

GET(WFILE); 

WDATA ; =WFILE* ; 

GET(WFILE) ; 

TEHPWORD:: WDATA. SPELLING; 

NOWOftD :  =  N0W0RD+  1 
END  («WHILE«) ; 

WNUMBER : -NOWORD  - 1 ; 

IF  (SPELL! >TEHPW0RD)  THEN 
BEGIN  ( *  I F * ) 

WRITELN; 


HRITELN  ('NONE  OF  THE  UORDS  IN  THE  VOCABULARY')? 

HRITELN  ('AGREE  WITH  THE  SPELLING  GIVEN  IN  THE'); 

WRITE  ('FILE  "' ,NANE,'“.  THE  SPELLING  IN  ',NANE,'  IS')? 
HRITELN  ('  -  , SPELL, ')? 

HRITELN  ('DO  YOU  WANT  TO  CHANGE  THE  “',NAHE,'"  SPELLING?')? 
HRITELN  ('ENTER  “YES"  TO  CHANGE,  ELSE  ENTER  “NO”')? 

URITE  ('ANSWER  =  ')? 

READLN( ANSWER) ? 

IF  ANSHERO’YES'  THEN 
BEGIN  ( *  I F * ) 

FLAG:-TRUE? 

ENDFLAG:=TRUE? 

END  (»IF*) 

ELSE 

BEGIN  («ELSE«) 

HRITELN? 

HRITELN  ('ENTER  THE  CORRECT  SPELLING  OF  WORD.')? 

URITE  CHORD  =  ')? 

READLN  (SPELL)? 

HRITELN? 

FOR  I:*  ( LENGTH (SPELL > )  TO  (SPELLENGTH-1)  DO 
BEGIN  (*FOR») 

SPELL :-CONC AT (SPELL,*  ')? 

END  («FOR«)? 

END  (*ELSE*) ? 

END  («IF«) 

ELSE 

BEGIN  («ELSE<) 

FLAGj*TRUE? 

END ( »ELSE« )  ? 

UNTIL  FLAG? 

END  («UITH«) ? 

IF  ENDFLAG=FALSE  THEN 
BEGIN  («IF«) 

HIGHSCOREi-O.O? 

HIGHHORD: = ' XXXXX* ? 

NEXTHICHEST Js0.0? 


{Store  speech  file  in  either  a  temporary  file  or 
in  file  RECOG  in  the  foreat  required  by  procedure 
SCOREUORD. > 


STOREHORD? 

HRITELN! *  INPUT  NAHE  OF  OUTPUT  FILE')? 

WRITE! ' FILENAME  =  ')? 

READLN(FILENAHE) ? 

HRITELN? 

{Output  heading  and  recognition  inforiation  to 
output  fi)e«> 

FI  LET  I TLE< RESULTS ,  FILENAME ) ; 

REHRITE(RESULTS) ? 

HRITELN! RESULTS)? 
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<D 


WRITELN(RESULTS) 
URITELN(RESULTS) 
WRITELN( RESULTS ) 
HRITELH  <  RESULTS ) 
WRITELN<  RESULTS , 
yRITELN(RESULTS) 
WRITELN(RESULTS) 
HRITELNtRESULTS , 
«RITELN( RESULTS) 
UR1TELN<  RESULTS , 
WRITELN(RESULTS) 
WRITELN ( RESULTS ) 


'  :20, ' WORD  RECOGNITION  RESULTS')? 

ACTUAL  WORD  =  '  ,  10UT . SPELL)  ? 

SPEECH  FILENAME  -  ' ,NAHE) ? 


URI TELN ( ' DO  YOU  WANT  TO  OUTPUT  THE  ERROR  STATISTICS?'); 
WRITELNT ENTER  "YES"  TO  OUTPUT  THEM;  ELSE  ENTER  "NQ'V); 
WRITEt'ANSUER  =  '); 

READLN(OUTERRORS); 

WRITELN; 

RESET(WFILE); 

NOWORD:sO; 

WHILE  NOT  EOF(UFILE)  DO 
BEGIN ( *UHI LE* ) 

SEEK(WFILE,NOWQRD); 

GET(WFILE) ; 

WDATA:-WFILEA ; 

WORD  :  =  WD AT A. SPELLING; 

WRITELN(RESULTS); 

WRITELN(RESULTS) ; 

WRITELN(RESULTS); 

WRITELN(RESULTS) ; 

WRITELN; 

WRITELNC . '); 

WRITELN  ( '  WuriU  ATTEMPTED  *  ',WQRD); 

WRITELN (RESULTS , ' WORD  ATTEMPTED  =  ',WQRD>; 

WRITELN(RESULTS,' . '); 

WRITELN(RESULTS) ; 

GETWORDVARS; 

WRITELN (RESULTS , ' THE  WORD  FUZZY  VARIABLES  THAT  WERE  USED 
WRITELN(RESULTS) ; 

WRITELN (RESULTS , ' STHR  =  ' ,STHR , '  SUBE  *  ',SUBE,'  SUBF 
WRITELN(RESULTS,‘ INSE  =  ',INSE,'  INSF  =  ',INSF); 
WRITELNCRESULTS »' DELE  =  ',DELE,'  DELF  *  ',DELF,'  DELG 
WRITELN (RESULTS  , ' DCNE  =  ',DCNE,'  DCNF  =  ’,DCNF,'  DCNG  =  ' 
WRITELNIRESULTS , ' SFE  =  ',SFE,'  SFF  =  ’,SFF); 

WRITELN(RESULTS,'CHVE  =  ' ,CHVE,'  CHVF  =  '  ,CHUF) ; 

WRITELN (RESULTS ,' STATE5  ', STATE,'  STATF5  '  ,STATF,‘  ST ATC  = 
WRITELN(RESULTS, ' THR1E=  ' ,THR1E,‘  THR1F5  *  ,THR1F) ; 
WRITELN ( RESULTS, 'THR2E5  ' ,THR2E,'  THR2F5  ' ,THR2F) ; 
WRITELN(RESULTS); 

SCOREWORD; 

IF  NOWORD:WNUMBER  THEN 
BEGIN( «IF« ) 

SEEK(SFILE,  WNUMBER); 

GET(SFILE) ; 


FOLLOW') 

' , SUBF ) 

'  ,DELG) 
DCNG); 

‘  ,STATG) 
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USTATSi sSFILEA ; 

WSTATS.T0TALSC0RE:=WSTA7S. TOTALSCQRE+WQRDSCORE; 
SEEX(SFILE,WNUHBER); 

SFILE* : =WSTATS; 

PUT (SFILE) ; 

ENDUIF#); 

WRI TELN ( ' WORD  SCORE  =  ' ,WORDSCORE) ; 

WRITELNC . * ); 

WRITELN(RESULTS,'WORD  SCORE  =  '  ,WORDSCORE) ; 

WRITELN(RESULTS) ; 

IF  OUTERRQRS5 ' YES'  THEN 
BEGIN ( »IF* ) 

liRITELN (RESULTS ,'PH OR D  =  ',PUORD,'  SCORECOUNT  =  ' ,SCOKECOUNT) 
WRI TELN (RESULTS , ‘ DELW  =  1  ,DELW,‘  DELCNT  =  DELCNT); 

END ( » I F* ) ; 

{Deteraine  highest  scoring  tiord,  and  score 
for  the  next  highest  scoring  word.) 

IF  (UORDSCORE( : HI GHSCORE) AND (UORDSCORE)NEXTHI CHEST)  THEN 
BEGIN(*IF«) 

NEXTHICHEST:=WORDSCORE; 

ENDUIF*); 

IF  ( (WORDSCORE) >HIGHSCORE)  THEN 
BEGIN(*IF*) 

NEXTHIGHEST^HIGHSCORE; 

HIGHSCORE: =WORDSCQRE; 

HIGHWORD:-WORD; 

ENDUIF*); 

CRECORD :  =  BEGINREC; 

FLAG:=TRUE; 

I0FFSET:=0; 

WHILE  ((FLAG-TRUE)OR(IOUT«CONTFLAG=1))DO 
BEGIN  (kUHILEk) 

FLAG:=FALSE; 

SEEK(SPECH, CRECORD); 

GET(SPECH) ; 

IOUTs=SPECHA; 

CRECORD CRE CORD+ 1 ; 

I  s  =  0; 

IF  OUTERRORS='YES'  THEN 
BEGIN ( *IF* ) 

{Output  vector  error  and  recognition 
inforaation  to  output  file.} 

WITH  IOUT  DO 
BEGIN  (*WITH*> 

WHILE  IfRCRDLENGTH  DO 
BEGIN  UWHILE*) 

WRITELN(RESULTS) ; 

WRITE (RESULTS > ’ VECTOR  *  '); 

FOR  J s  =  1  TO  10  DO 
BEGIN(*F0R») 


INT:=J+I+IOFFSET; 

WRITE ( RESULTS ) I NT  :9) ; 

END  (»FOR») ; 

WRITELN(RESULTS) ; 

WRITEtRESULTS, 'PHONEME  CHOSEN  =  ')? 

FOR  Js=l  TO  10  DO 
BEGIN (*FOR* ) 

WRITE (RESULTS) IPHONE! J+I):9) $ 
ENDUFOR*); 

WRITELN ( RESULTS ) ; 

WRITE (RESULTS  t ' SCORE  =  ’); 

FOR  J : = 1  TO  10  DO 

BEGINUFOR*) 

WRITE ( RESULTS) I SCOREtJ+I 1:9);  . 

END ( «FOR* ) ; 

WRITELN ( RESULTS ) ; 

WRITE (RESULTS  > ‘ ERROR  *  ') 

FOR  J ;= 1  TO  10  DO 
BEGIN ( *FOR» ) 

WRITE(RESULTS)IVERRORIJ+I],'  '  :4); 
END(#FOR#> ; 

WRITELN (RESULTS) ? 

WRITELN(RESULTS) i 
WRITELN(RESULTS) ; 

I:sI+10; 

END  (*WHILE»); 

END  («WITH*>i 
END(*IF») ; 

IOFFSET:=IOFFSET+RCRDLENGTH; 

END  (*UHILE>) ; 

SEEK(WFILE,NOWORD); 

GET(WFILE) ; 

GET(WFILE); 

N0W0RD:  =  N0W0RD  +  1 ; 

END  (*WHILE»)? 


{Update  word's  recognition  statistics.) 

SEEIUWFILE , WNUttBER >  $ 

GET(WFILE); 

WDATA i sWFILE* ; 

SEEK(SFILE ,WNUHBER) ; 

GET(SFILE) ; 

WSTATS JsSFILE* ; 

WSTATS.  NO  ATTEMPT :  *WST  ATS.  NOATTEPIPT  +  1  ? 

IF  HIGHWORD=WDATA, SPELLING  THEN 
BEGIN(«IF«) 

WSTATS. NOCORRECT:  =  WST ATS. NOCORRECT  + 1 ; 

WSTATS.TOTDIFF  s  =  WST ATS . TOTD I F F  +  ( H I GHSCORE -HEXTH I GHEST ) 
IF  (HIGHSCORE-NEXTHIGHEST) (WSTATS .MI NDIFF  THEN 
BEGINUIF*) 

WSTATS. MINDIFF:*HIGHSCORE-NEXTHIGHEST; 

END  (*IF«) ; 

END(*IF»); 
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SEEK ( SFILE ,WNUWBER) ; 

SFILE*  s  =  USTATS ; 

PUT(SFILE); 

WRITELN; 

WRITELH ; 

UR  I T  ELM ( ' THE  RECOGNITION  RESULTS  ARE  IN  FILE  FILENAME, 

END(*IF») ; 

CLOSE (RESULTS ) ; 

END  ^PROCEDURE  ISORECOG*); 


{**#******#*#*#*#*#**»*##*##**§*#»###*##****##*#*»**#**#****#*»****#* 


PROCEDURE  CHNGEREP  DESCRIPTION 


THIS  PROCEDURE  PROVIDES  THE  USER  WITH  THE  ABILITY  TO 
MANUALLY  CHANGE  THE  PHONEME  REPRESENTATION  OF  A  SPECIFIED  WORD. 

AFTER  THE  NEW  REPRESENTATION  IS  ENTERED,  THIS  PROCEDURE  WILL  RE¬ 
INITIALIZE  THE  WORD'S  STATISTICS  USING  THE  WORD'S  TRAINING  SET  STORED 
IN  FILE  SPECH.  IF  THE  USER  DOES  NOT  WANT  THE  STATISTICS  TO  BE 
INITIALIZED  (THE  USER  MAY  WANT  TO  ALTER  THE  REPRESENTATION 
OF  SEVERAL  WORDS,  AND  THEN  EXECUTE  THE  AUTOMATED  INITSTAT  ROUTINE)  THIS 
PROCEDURE  CAN  BE  ABORTED  WHEN  "INITSTAT"  APPEARS  ON  THE  TERMINAL  SCREEN. 


««*«««««««*«««««»»**««*««»««««««• a##**#****#****#***#*#*#*#*#***#*#*#) 


SEGMENT  PROCEDURE  CHNGEREP; 

VAR 

U:INTEGER; 

I  {INTEGER; 

NOWORD: INTEGER; 

TEHPW0RD:STR1NGCSPELLENGTH] ; 

ANS: INTEGER; 

FLAGsBOOLEAN; 

BEGIN ( *INI Tl* ) 

INITFILE; 

ENDFLAG :*FALSE; 

URITELN; 

{Get  word  to  be  operated  on,  determine  if  it  is  in 
the  current  vocabulary,  and  set  variable  NOWORD 
to  indicate  the  word.) 

WRITELN  ('INPUT  EXACT  SPELLING  OF  WORD  WHOSE  REPRESENTATION  '); 
URITELN < ' IS  TO  BE  CHANGED.'); 

WRITE  (‘WORD  *  ’); 

READLN  (WORD); 

FOR  I :s (LENGTH(WORD) )  TO  (SPELLENGTH-l)  DO 
BEG1N(KF0R«) 

WORD : *CONCAT ( WORD , *  '); 

ENDOFOR*); 

WRITELN  ; 

RESET ( SFILE ) ; 
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RESET  (WFILE) ; 

TEHPWORD:5'  '; 

N0W0RD:-0; 

WHILE  (NOT  EOF (WFILE) ) AND ( UOF(D <  >TEHF'WDRD)  DO 
BEGIN  ( #WHILE* ) 

SEEK (WFILE , NOWORD) ; 

GET (WFILE); 

WDATA : = WFI LE A ; 

GET (UFILE) ; 

TEMPWORD:=WDATA. SPELLING; 

NOWORD:=NOWORD+  1 
END  < *WH I LE* ) ; 

NOWORD:=NOWQRD-1 ; 

IF  (WORDOTEF1PWORD)  THEN 
BEGIN  (#IF») 

WRITELN; 

WRITELN  ( ‘ WORD  DOES  NOT  EXIST.  *); 

ENDFLAG:=TRUE; 

END  (*IF*) 

ELSE 

BEGIN  (*ELSE*) 

{Output  word's  current  phoneae  representation;  and 
get  the  new  one.} 

WITH  WDATA  DO. 

BEGIN  (»WITH») 

SPELLING:=WORD; 

WRITELN; 

WRITELNt 'THE  CURRENT  PHONEME  REPRESENTATION  FOLLOWS:'); 

FOR  1 5  =  1  TO  SPELLENGTH  DO 
BEGINUFOR*) 

WRITE(WDATA.PHREP[I3:3) ; 

END(*FOR») ; 

WRITELN; 

WRITELN; 

WRITELN; 

WRITELN  CENTER  THE  NUMERIC  VALUE  (01  TO  71)  FOR  ’); 
WRITELN  ('EACH  PHONEME  IN  THE  WORD;  UP  TO  A  MAXIMUM  OF  '); 
WRITE  (MAXPHONES); 

WRITELN  ('  PHONEMES.  AFTER  ALL  WORD  PHONEMES  HAVE  BEEN  ') 
WRITELN  CENTERED,  ENTER  "00".  '); 

U :  =  1 ; 

ANS  5  =  71 5 

WHILE  (W<sNAXPHONES)AND(ANS<  >00)  DO 
BEGIN  <«WHILE«) 

WRITELN; 

WRITE  ('WORD  PHONEME  ',W:2,'  =  '); 

READLN  (ANS); 

IF  (00<  =  ANS)AND( ANS<  =71 )  THEN 
BEGIN  <«1F«) 

PHREPCW3  s  =  ANS; 

W :  =W+1 
END  («IF«) 
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ELSE 

BEGIN  (*ELSE») 

UR1TELN; 

WRITELN  ( ' PHONEME  NOT  VALID,  REENTER' 1 ; 
END  <*ELSE*>; 

END  (*UHILE») ; 


IF  PHREPC13  =  00  THEN 
BEGIN  t » IF* ) 

URITELNt  *  NO  DATA  RECORDED  (WORD  PHONEflE  1  * 
ENDFLAG;-TRUE 
END ( * I F* ) 

ELSE 

BEGIN  (#ELS£«) 

IF  U<HAXPHONES  THEN 
BEGIN  ( * I F* ) 

FOR  J:*  U  TO  HAXPHONES  DO 
BEGIN  (*FOR») 

PHREPC J3  s  =  00 
END  <*FOR*>; 

END  (*IF») ; 

SEEK ( WF I LE , NOWORD) ; 

WFILE* :=WDATA; 

PUT  (HFILE) ; 

{Initialize  the  word's  statistics  using  the 
overall  fuzzy  variables  and  the  word's 
training  files  stored  in  file  SPECH.J 

GETFUZZYVARS; 

INITSTAT(NOHORD) ; 

END  <«ELSE»)* 

END  ( *UITH» ) ; 

END  («ELSE«); 


ENDUPROCEDURE  CHNGEREP* ) 


(#####»*##«»**#*#********##***##*#*****##**###*##*#«***#*###**###*### 


PROCEDURE  CHNCEFUZ  DESCRIPTION 


THIS  PROCEDURE  ENABLES  THE  USER  TO  CHANCE  EITHER  THE  OVERALL 
FUZZY  VARIABLES  STORED  IN  FILE  FUZZYVAR,  OR  A  SPECIFIED  WORD'S 
FUZZY  VARIABLES. 


«*X»«*»*«*««*««***««»***K**«*»*K »»««**»««*«*»*«*«***«»«*««*«««***«»«*»} 


SEGMENT  PROCEDURE  CHNCEFUZ; 

VAR 

TEHPWORD:STRING[SPELLENGTHI; 
ANSWER SSTRINGI3I ; 


PROCEDURE  CHWORVAR; 

VAR 

IsINTEGER; 

BEGIN (*CHWOR VAR*) 

IN1TF1LE; 

WRITELN; 

WRITELN  ('INPUT  EXACT  SPELLING  OF  WORD  WHOSE  VARIABLES  '); 
WRITELNt 'ARE  TO  BE  CHANGED.'); 

WRITE  ('WORD  *  '); 

READLN  (WORD); 

FOR  1 : 1 (LENGTH( WORD ) )  TO  (SPELLENGTH-1 )  DO 
BEGIN(*FOR») 

WQRDs=CQNCAT(WORD,'  '); 

END(*FOR*) ; 

WRITELN  ; 

RESET (SPILE); 

RESET  (WFILE); 

TEHPWORDs*'  '; 

WORDNOssO; 

WHILE  (NOT  EOF (WFILE) )AND(WORD< )TEHPWORD)  DO 
BEGIN  («WHILE») 

SEEK (WFILE, WORDNO) ; 

GET(WFILE) ; 
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UDATAs  =  WFILE* ; 

GET ( WF I LE ) ; 

TEfIPWORD :  =  WDATA .  SPELLING  ^ 
UORDNO:*UORDNO+  1 
END  UUHILE*); 

UORDNO:=UORDNO-l; 

IF  (UORD<  > TEFIPWORD )  THEN 
BEGIN  (*IF») 

URITELN; 

URITELN  ('WORD  DOES  NOT  EXIST.  ' ); 
END  (»IF») 

ELSE 

BEGIN  UELSE*) 

WITH  UDATA  DO 
BEG1NUU1TH*) 

REPEAT 

URITELN; 


UR1TELN( ' THE  CURRENT  FUZZY  VARIABLES  FOLLOUs  '); 
URITELN; 


URITELNCSTHR  : 

'  ,USTHR,' 

SUBE  * 

‘  ,USUBE t '  SUBF  *  '  ;USUBF) ; 

URITELNC  INSE  : 

1  >UINSE t ' 

INSF  = 

' ,UINSF); 

URITELNCDELE  : 

' ,UDELE , ‘ 

DELF  * 

',UDELF,'  DELG  =  '  ,UDELG) ; 

URITELNCDCNE  * 

‘  ,UDCNE,‘ 

DCNF  * 

' > WDCNF t '  DCNG  *  1  ,WDCNG); 

URITELNCSFE  : 

'  ,USFE,' 

SFF  * 

jUSFF) ; 

URITELN ( ’CHVE  * 

'  ;UCHVE  t ' 

CHVF  * 

'  ,UCHVF); 

URITELNC  STATE* 

'jUSTATE,' 

STATF 

',USTATF,*  STATG*  ' ,USTATG 

HRITELNCTHRiE* 

1  ,UTHR1E,' 

THR1F 

'  ,UTHR1F); 

URITELNt ' THR2E* 

',UTHR2E,' 

THR2F 

*  ,UTHR2F); 

URITELN; 

URITELNTDO  YOU  WISH  TO  CHANGE  ANY  FUZZY  VARIABLES?  '); 
URITELN( 'ENTER  "YES"  TO  CHANGE;  ELSE  ENTER  "NO".'); 
URITE( ' ANSUER  *  '); 

READLNt ANSWER); 

URITELN; 

IF  ANSUER:  'YES'  THEN 
BEGIN(»IF») 

URITECOLD  STHR  *  ’ ,USTHR,'  NEU  STHR  *  '); 
READLN(USTHR); 

WRITE  ( '  OLD  SUBE  --  ',USUBE,'  NEU  SUBE  *  '); 
READLN(USUBE) ; 

URI TE < ' OLD  SUBF  *  ',USU3F,'  NEU  SUBF  *  '); 
READLN(USUBF); 

URITECOLD  INSE  *  ',UINSE,'  NEU  INSE  »  '); 
READLN(UINSE) ; 

URITECOLD  INSF  *  '  ,WINSF , '  NEU  INSF  *  '); 
READLN(UINSF) ; 

URITECOLD  DELE  *  '  ,WDELE , '  NEU  DELE  :  '); 
READLN(UDELE) ; 

UR1TEC OLD  DELF  :  ',UDELF,'  NEU  DELF  =  '); 
READLN(UDELF) ; 

URITECOLD  DELG  =  ',UDELG,'  NEU  DELG  =  '); 
READLN(UDELG) ; 

URITECOLD  DCNE  =  ',UDCNE,'  NEU  DCNE  :  '); 
READLN(UDCNE) ; 
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WRITE( 'OLD  DCNF  5  ',WDCNF,'  NEW  DCNF  *  '); 
READLN(WDCNF) ; 

UR  I TE  < ' OLD  DCNG  5  '  ,  WDCNQ , '  NEW  DCNG  =  ' ); 
READLN(WDCNG) ; 

WRITE( ' OLD  SFE  *  ' ,WSFE  NEW  SFE  -  '); 
READLN(USFE) ; 

UR1TE ( ' OLD  SFF  5  ',USFF  ,'  NEW  SFF  5  '); 
READLN(WSFF) ; 

UR I TE ( ' OLD  CHVE  5  ',WCHVE,‘  NEW  CHVE  5  '); 
READLN(WCHVE) ; 

WRITE( ' OLD  CHVF  5  ',WCHVF,'  NEW  CHVF  5  '); 
READLN(WCHVF) ; 

WRITE( 'OLD  STATE5  ',WSTATE/  NEW  STATE5  '); 
READLN(WSTATE); 

UR I TE < 'OLD  ST ATF 5  ',WSTATF,'  NEW  STATF5  '); 
READLN(WSTATF); 

WRITEC OLD  STATC5  ',WSTATG,'  NEW  STATG5  '  >; 
READLN(WSTATG) ; 

WRI TE ( ' OLD  THR1E5  ',WTHR1E,'  NEW  THR1E5  '); 
READLN(WTHRIE) ; 

URITE ( ' OLD  THR1F5  ',WTHR1F,'  NEW  THR1F5  '); 
READLN(WTHRIF) ; 

WRITE( ' OLD  THR2E5  ',UTHR2E,'  NEW  THR2E5  '); 
READLNIWTHR2E); 

WRITE('OLD  THR2F5  ',WTHR2F,'  NEW  THR2F5  '); 
READLN( WTHR2F) ; 

WRITELN; 

ENDUIF*); 

UNTIL  (ANSWERO'YES'); 

* 

SEEX ( WF I LE ;WORDNO> ; 

WFILE* :5WDATA; 

PUT(WFILE); 

END ( « W I TH* ) ; 

END(*ELSE») ; 

END(«PROCEDURE  CHWORVAR*); 


BEG1NOCHNGEFUZO ; 

WR I TELN  ( '  DO  YOU  WANT  TO  CHANGE  A  WORD'S  FUZZY  VARIABLES/); 
UR  ITELN  ( *  OR  THE  OVERALL  FUZZY  VARIABLES?.  ENTER  '’WORD"  TO') 
WR1TELNC  CHANGE  A  WORD'S  VARIABLES.’); 

WRITEC  ANSWER  5  ’); 

READLN( ANSWER) ; 

WRITELN; 

IF  ANSWERO'WOR'  THEN 
BEGIN(«IF») 

RESET< FUZZYVAR ) ; 


IF  NOT  EOF(FUZZYVAR)  THEN 
BEGIN<#IF») 

GETFUZZYVARS; 

END ( *  I F* ) 

ELSE 

BEGIN<*ELSE») ; 

STHR  2  =  1? 

SUBE : = 1 ; 

SUBF*.  =  1; 

I  USE :  =  1 ; 

I NSF  s  =  t  ? 

DELE :  =  1 ; 

DELF:=1; 

DELG :  =  1 ; 

DCNEi5l; 

DCNF:51 ; 

DCNG:-1 ; 

SFE  s  =  1 5 
SFF  s  =  1 ; 

CHVE:=1; 

CHVF:=1; 

STATE : = 1 ? 

STATFi-1; 

STATC  s  =  l; 

THRlEs  =  1 ; 

THR1F:=1 ; 

THR2E:sl ; 

THR2F  s  =  1 ; 

ENDUELSE*); 

REPEAT 

URITELN; 

URITELNt ' THE  CURRENT  FUZZY  VARIABLES  FOLLOU:  '); 
URITELN; 

URITELN ( 'STHR  5  '  ,STHR.'  SUBE  5  ',SUBE,'  SUBF  5  '  ,SU 
URITELN ( ' INSE  5  ',INSE,'  INSF  *  ',INSF>; 

URITELN ( ' DELE  5  ',DELE,'  DELF  *  ',DELF,'  DELG  *  * ,DE 
URITELN ( ' DCNE  *  *,DCNE,'  DCNF  *  ',DCNF,’  DCNG  *  *  ,DC 
WRITELNCSFE  5  ',SFE,'  SFF  5  ',SFF>; 

UR  I TELN ( ' CHVE  5  *,CHVE,'  CHVF  -  ',CHVF>; 

HRITELN( 'STATE5  ', STATE,'  ST ATF  =  ',STATF,'  STATG5  ', 
URITELN< '  THR1E5  ',THR1E,'  THR1F5  ’  ,THRIF> ; 

UR1TELN( '  THR2E=  ',THR2E,'  THR2F-  '  ,THR2F) ; 

URITELN; 

URITELNt'DO  YOU  UISH  TO  CHANGE  ANY  FUZZY  VARIABLES?  ') 
URITELNC ENTER  "YES"  TO  CHANGE;  ELSE  ENTER  "NO”.'); 

UR ITE ( ' ANSWER  =  '); 

READLN(ANSUER) ; 

URITELN; 

IF  ANSWER5  'YES'  THEN 
BEGIN(*IF«) 

UR  1 TE < 'OLD  STHR  5  ',STHR,'  NEU  STHR  5  '); 
REABLN(STHR); 

URITE ( '  OLD  SUBE  5  ',SUBE,'  NEU  SUBE  5  '); 
READLN(SUBE)  i 


WRITE  < '  OLD  SliBF  =  ',SUBF,'  NEW  SUBF  =  *); 
READLN(SUBF) ; 

WRITE!  *  OLD  INSE  =  ',INS£,'  NEW  INSE  =  '); 
READLH ( INSE ) ; 

WRITE< '  OLD  INSF  =  ',INSF,'  NEW  INSF  *  '); 
READLN(INSF); 

WRITE( ' OLD  DELE  =  ',DELE,'  NEW  DELE  -  '); 
READLN(DELE) ; 

WRITE  t '  OLD  DELF  =  *  ,DELF,‘  NEW  DELF  *  ' )  ♦, 
READLN(DELF) ; 

WRITE ( ' OLD  DELC  =  '  ,DELG , '  NEW  DELC  =  '); 
READLN(DELG) ; 

WRITE ( ' OLD  DONE  *  ',DCNE,'  NEW  DONE  =  '); 
READLN(DCNE) ; 

WRITE C ' OLD  DCNF  =  ',DCNF,‘  NEW  DCNF  =  '); 
READLN(DCNF) ; 

WRITE!' OLD  DCNG  =  '  ,DCNG,'  NEW  DCNG  =  ')? 
READLN(DCNG); 

WRITE ( ' OLD  SFE  *  '  ,SFE  ,'  NEW  SFE  *  '); 
READLN(SFE) ; 

WRITE! ' OLD  SFF  *  ' ,SFF  ,'  NEW  SFF  =  '); 
READLN(SFF) ; 

WRITE ( ' OLD  CHVE  *  ',CHVE,'  NEW  CHVE  »  '); 
READLN(CHVE) ; 

WRITE! 'OLD  CHVF  =  ',CHVF,'  NEW  CHVF  =  '); 
READLN(CHVF); 

WRITECOLD  STATE1  '/STATE,'  NEW  STATE-  ') 
READLN(STATE); 

WRITE! ' OLD  STATF--  '  ,STATF,'  NEW  STATF5  ') 
READLN(STATF) ; 

WRITE! ' OLD  STATG*  ' >STATG , '  NEW  STATG=  ’) 
READLN(STATG); 

WRITECOLD  THR1E=  '  ,THR1E,'  NEW  THR1E*  ') 
READLN(THRIE) ; 

WRITECOLD  THRIFT  '  ,THR1F,'  NEW  THR1F=  ') 
READLN(THRIF) ; 

WRITE! ' OLD  THR2E=  '  ,THR2E,'  NEW  THR2E5  ') 
READLN(THR2E>', 

WRITECOLD  THR2F  =  '  ,THR2F,'  NEW  THR2F  =  ') 
READLH ! THR2F ) ; 

WRITELN', 

END ( *1 F* ) ; 

UNTIL  (ANSWER! >'YES' ) ? 


REWRITE(FUZZYVAR) $ 

WRITELN ( FUZZ YVAR,  STHR, SUBE,  SUBF ) ; 
.WRITEIN!  FUZZY  VAR  ,INSE  ,INSF>  ; 

WRITELN (FUZZY VAR, DELE, DELF, DELG ) ; 
WRITELN(FUZZYVAR,DCNE, DCNF, DCNG)', 
WRITELN! FUZZ YVAR, SFE, SFF ) ; 

WRITELN ! FUZZYVAR  f CHVE  ,CHVF>; 
WRITELN(FUZZYVAR,STATE,STATF,STATG); 


WRITELNt FUZZY VAR ,THR1E,THR1F) ; 
WRITELN<FUZZYVAR,THR2£,THR2F); 
CLOSE(FUZZYVAR); 

END ( « I F* ) 

ELSE 

8EGINUELSE*) 

CHWORVAR; 

ENDUELSE*)? 


END(*PROCEDURE  CHNGEFUZ*) ; 


{***»****«****»**#**************#«**********#**** *#***#*»*###*###### 


PROCEDURE  CHNGEOPT  DESCRIPTION 


THIS  PROCEDURE  PROVIDES  THE  USER  WITH  THE  ABILITY  TO  CHANGE 
EITHER  THE  INITIAL  0PT1PIUN  FUZZY  VARIABLE  VALUES  AND  LIMITS  STORED 
IN  FILE  OPTFUZZY ,  OR  A  SPECIFIED  WORD’S  OPTIHUN  FUZZY  VARIABLE 
VALUES  AND  LIMITS  STORED  IN  FILE  FZOPT. 


#*#*#**»**#*»*#******«###*»#«**********#*#**##**#»#»*#*#**#*#**##***} 


SEGMENT  PROCEDURE  CHNGEOPT? 

VAR 

WNUHB,I ,J:INTEGER; 
FNAMEsSTRINCCS] ; 
ANSUER;STRINGC3] ; 
TEUPWORDjSTRINGISPELLENGTH] ? 


BEGIN(*CHNGEOPT»)  ? 

WRITELN( ' DO  YOU  WANT  TO  CHANGE  A  WORD'S  FUZZY  VARIABLES,'); 
WR1TCLN < ' OR  THE  OVERALL  FUZZY  VARIABLES?,  ENTER  "WOFV-  TO')? 
WR1TELN< ' CHANGE  A  WORD'S  VARIABLES,'); 

WRITEC ANSWER  *  '); 


READLN (ANSWER ) ; 

URITELN; 

IP  ANSWER < >'UOR'  THEN 
BEGINUIF*) 

RESET(OPTFUZZY) ; 

IF  NOT  EOF(OPTFUZZY)  THEN 
BEGIN(*IF») 

WITH  FLIHIT  DO 
BEGIN ( «WITH* ) 

FOR  1 5  =  1  TO  NOFUZZYVARS  DO 
BEGINUFOR*) 

READLN( OPTFUZZY  #FUZVARE 13 ,«AXLIHITC 13  ^HINLIIIITLI]  ,FUZNAHEC I D ) 
ENDUFOR*); 

RECSC0RE:=0; 


ENDUUITH*); 


ENDUIF*) 

ELSE 

BEGINUELSE*); 
WITH  FLIHIT  DO 
BEGIN(*UITH«) 
FUZN AHE [13: 

=' STHR  •; 

FUZNAHECZ3 : 

= '  SUBE 

FUZNANE C 3 3 : 

‘ SUBF 

FUZNAHEC43 : 

'INSE 

FUZNAHEC53 : 

' INSF  '  ; 

FUZNAHEC6] : 

'DELE  '? 

FUZNAHEC7] : 

'DELF 

FUZN AHE  C  83 : 

' DELG  '$ 

FUZNAHEC93 : 

'DONE  ' ; 

FUZNAHEC 103 

' '  DCNF  ' 

FUZN AHEt 113 

=  ’  DCNG  ' 

FUZNAHEC 123 

= '  SFE  ' 

FUZNAHEC 133 

s ’ SFF  ' 

FUZNAHEC 143 

s '  CHVE  ' 

FUZNAHEC153 

s '  CHVF  ' 

FUZNAHE  1 63 

ESTATE' 

FUZN AHE  C 1 7  3 

=  ‘  ST  ATF ' 

FUZNAHEC 183 

~  ' STATG' 

FUZNAREC193 

= ' THR1E’ 

FUZNAHEC203 

= ' THR1F' 

FUZNAHEC213 

S‘THR2E’ 

FUZNAHE  C  2  2  3 

= ' THR2F ' 

FOR  I s  =  1  TO  NOFUZZYVARS  DO 
BEGIN  <  »FOR* ) 

FUZV AR [ 1 3  s  =  1 ; 
HAXLIHITCI3  :  =  10; 
HINLIHITCI3:=0; 
END(«FOR«)*, 

END(*HITH«); 

ENB< *ELSE* ) ; 

WITH  FLIHIT  DO 
BEGINUHITH*) 

REPEAT 


WRITELN; 


UR I TELN < 'THE  CURRENT  FUZZY  VARIABLES  FOLLOW:  '); 

URITELN; 

UR  I TELN  <  *  FUZZY  INITIAL  MAXLII1IT  NINLIHIT' ) ; 
URITELN ( *  VARIABLE  VALUE*); 

URITELN!* .  .  . '); 

URITELN; 

FOR  I:=l  TO  NOFUZZYVARS  DO 
BEGIN ( «FOR«  > 

WRITELNtFUZNAHECn  :fl,FUZVARCI]  :10:4,PIAXLIHIT[I]  :14:4  ^MINLIHITCII  :14:4) 
END ( *FOR* ) ; 

URITELN; 

URITELN; 

URITELN; 

UR I TELN ( ' DO  YOU  WISH  TO  CHANCE  ANY  FUZZY  VARIABLES?  '); 

URITELN ( ' ENTER  "YES"  TO  CHANGE;  ELSE  ENTER  “NO".*); 

URITET  ANSWER  =  '); 

READLNt ANSWER) ; 

URITELN; 

IF  ANSWER^  *  YES*  THEN 
BEGI N ( *  I F» ) 

URITELN < ' ENTER  THE  NAHE  OF  THE  VARIABLE  TO  BE  CHANCED'); 
URITET  VARIABLE  NAHE  *  *>; 

READLN(FNAHE) ; 

IF  LENGTH!FNAHE)!5  THEN 
BEGIN(«IF«) 

FQR  I :=LENGTH<FNAHE)TO  4  DO 
BEGINUFOR*) 

FNAME:=CONCAT!FNAHE,*  '); 

ENDUFOR*); 

ENDUIF*); 

URITELN; 

J:=0; 

FOR  I*«l  TO  NOFUZZYVARS  DO 
BEGIN!*FOR«) 

IF  FUZNAnECI]sFNAHE  THEN 
BEGIN ( *IF*> 

J«z  J  . 

END ( »IF* ) ; 

END ( *FOR* ) ; 

IF  J*0  THEN 
BEGIN!*IF»); 

URITELN! ' VARIABLE  NAHE  DOES  NOT  EXIST,'); 

END(#IF«) 

ELSE 

BEGINUELSE*) 

URITELN ( '  ENTER  THE  FOLLOUING  VALUES  FOR  ‘  ,FNAHE) ; 
URITELN; 

WRITE! *  INITIAL  VALUE  *  '); 

READLN(FUZVARCJI) ; 

WRITE! ' HAXLIHIT  *  '); 

READLNtHAXLIHITIJ]); 

WRITE! '  HINLIMT  =  '); 

READLN(IUNLIHlTtJI); 

END(»ELSE»); 


WRITELN; 

UR1TELN; 

END ( *  I F* ) ; 

UNTIL  (ANSWERO'YES'); 


REWRITE(OPTFUZZY) ; 

FOR  I s  =  1  TO  NQFUZZYVARS  DO 
BEG1NUFOR*) 

WRITELN (OPT FUZZY  ;FUZVAR[1] ,NAXLIHIT[I]  >t1INLIMITCI  3  fFUZNANECID ) 
ENDUFOR*); 

CLOSE(OPTFUZZY) ; 

ENDUWITH*); 

END ( *IF# ) 

ELSE 

BEGIN(*ELSE») 

WRITELN; 

WRITELN  ('INPUT  EXACT  SPELLING  OF  WORD  WHOSE  VARIABLES  '); 

WRITELN! ' ARE  TO  BE  CHANGED.'); 

WRITE  (‘WORD  =  '); 

READLN  (WORD); 

FOR  I s  =  (LENGTH(HORD) )  TO  (SPELLENGTH-1 )  DO 
BEGIN ( *FOR* ) 

WORD: :CONCAT < WORD  > '  '); 

END(*FOR*>; 

WRITELN  ; 

RESET  (WFILE); 

TEHPWORD:=' 

WNUNB : =0; 

WHILE  (NOT  EOF (WFILE) ) AND ( WORD < )TEHPWORD)  DO 
BEGIN  (*WHILE») 

SEEK( WFILE  ,WNUHB) ; 

GET(WFILE) ; 

WDATA:  =  WFILE* ; 

GET(WFILE) ; 

TEHPWORD:*WD AT A. SPELLING; 

WNUWB:= UNUHB+  i 
END  UWHILE*); 

WNUflB:  =  WNUflB*l ; 

IF  (WORDOTENPWORD)  THEN 
BEGIN  (*IF*) 

WRITELN; 

WRITELN  ('WORD  DOES  NOT  EXIST.  '); 

END  ( *IF* ) 

ELSE 

BEGIN  (»ELSE») 

RESET(FZOPT) ; 

SEEK(FZOPT, WNUNB); 

GET(FZOPT) ; 

FLIMIT 5  =  FZOPTfc ; 

WITH  FLIHIT  DO 
BEGIN(«WITH«) 
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REPEAT 

WRITELN; 

WRITELN* ' THE  CURRENT  FUZZY  VARIABLES  FOLLOW:  '); 

WRITELN; 

WRITELN*'  FUZZY  INITIAL  HAXLIH1T  H1NLIHIT'); 

WRITELN* ‘VARIABLE  VALUE’  )  ; 

WRITELN*' .  .  . '); 

WRITELN; 

FOR  1 5  =  1  TO  NOFUZZYVARS  DO 
BEGINUFOR*) 

WRITELN(FUZNAnECl]:8,FUZVAR[I]:l0:4,!1AXLiniTCI3:14:4,l1INLll11T[I]:14:4) 
END ( *FOR*  > ; 

WRITELN; 

WRITELN; 

WRITELN; 

WRITELN ( ' DO  YOU  WISH  TO  CHANGE  ANY  FUZZY  VARIABLES?  '); 
WRITELN* ‘ ENTER  “YES"  TO  CHANGE;  ELSE  ENTER  "NO".’); 

UR I TE ( 'ANSWER  =  '); 

READLN* ANSWER); 

WRITELN; 

IF  ANSWER*  'YES’  THEN 
BEG1N(*IF*> 

WRITELN* 'ENTER  THE  NAME  OF  THE  VARIABLE  TO  BE  CHANGED'); 

WR I TE ( ' VARIABLE  NAHE  =  '); 

READLN(FNAHE) ; 

IF  LENGTH(FNAHE) <5  THEN 
BEGIN(*IF*) 

FOR  1 ; =LENGTH (FNANE)TO  4  DO 
BEGINUFOR*) 

FNAHE:*CONCAT<FNAHE,'  '); 

END*«F0R«); 

ENDUIF*); 

WRITELN; 

J:*0; 

FOR  1**1  TO  NOFUZZYVARS  DO 
BEGINUFOR*) 

IF  FUZNAHECn  =  FNAHE  THEN 
BEGINUIF*) 

J:*l ; 

END**IF«); 

ENDUFOR*) ; 

IF  J=0  THEN 
BECIN**IF»); 

WRITELN ( ' VARIABLE  NAHE  DOES  NOT  EXIST.'); 

END ( *  IF* ) 

ELSE 

BEGIN ( *ELSE* ) 

WRITELN ( ' ENTER  THE  FOLLOWING  VALUES  FOR  ',FNAHE); 
WRITELN; 

WRI7E< ' INITIAL  VALUE  *  '); 

READLN (FUZVARtJD) ; 

WRITE ( 'HAXL1H1T  *  '); 

READLNinAXLIfllTtJI) ; 

WRITE ( ‘HINLIHIT  *  '); 
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READLN(HINLIHIT[J]) ; 
END ( *ELBE* ) ; 

WRITELNi 

WRITELN; 

END  t  *  I F  *  > ; 

UNTIL  < ANSHER<  > ' YES' ) ; 

SEEK  tFZOPT >MNUHB) $ 

FZOPT A i =FLIH1T ; 

PUT(FZOPT) ; 

END ( *W I TH* ) ; 

ENDUELSE*); 

ENDUELSE*); 


ENDUPROCEDURE  CHNGEOPT * ) ; 


{************###******#*#***#»#*#*****###»**#*#*##******#»*#**#*##* 


PROCEDURE  AUTODET  DESCRIPTION 


THIS  PROCEDURE  DETERHINES  A  PHONEHE  REPRESENTATION  FOR  A  GIVEN 
UORD  BASED  ON  THE  TRAINING  SPEECH  FILES  STORED  IN  FILE  SPECH.  THIS 
PROCEDURE  ESSENTIALLY  PROVIDES  THE  FOLLOWING  FOUR  OPERATIONS: 


OPERATION  DESCRIPTION 


AUTODET  -  DETERHINES  AN  INITIAL  PHONEHE  REPRESENTATION  BASED 

ON  THE  POSITION  AND  NUHBER  OF  TIHES  A  PHONEHE  OCCURS 
IN  THE  TRAINING  FILES. 

CORRECTR  -  USES  THE  STATISTICS  COLLECTED  FROH  THE  PREVIOUS 
REPRESENTATION  TO  IHPROVE  IT  BY  INSERTING  THE 
PHONEHES  WHOSE  INSERTION  AND  SUBSTITUTION  STATISTICS 
WHERE  HIGHER  THAN  THOSE  OF  THE  INITIAL  REPRESENTATION. 

PICXBEST  -  USES  THE  STATISTICS  COLLECTED  FROH  THE  PREVIOUS 

REPRESENTATION  TO  IHPROVE  IT  BY  REPLACING  THE  PHONEHES 
WHOSE  INSERTION  AND  SUBSTITION  STATISTICS  ARE  LOWER  THAN 
ANOTHER  PHONEHE'S. 

ADJUSTDEL  -  ALTERS  THE  CURRENT  PHONEHE  REPRESENTATION  BASED  ON  THE 
NUHBER  OF  TIHES  EACH  WORD  PHONEHE  WAS  DELETED.  THE 
DELETION  STATISTICS  WERE  COLLECTED  USING  THE  PREVIOUS 
PHONEHE  REPRESENTATION. 

PROCEDURE  CONSTANTS 


CONSTANT  DESCRIPTION 


THRESHHOLD  -  CONSTANT  USED  IN  PROCEDURE  FINDEND  TO  DETERHINE 
WHEN  A  WORD  PHONEHE  ENDS. 


HAIN  GLOBAL  PROCEDURE  VARIABLES 


VARIABLE  DESCRIPTION 


PHINT  -  INTEGER  INDICATING  THE  CURRENT  WORD  PHONEHE. 

DPHREP  *  ARRAY  USED  TO  STOkE  THE  WORD  PHONEHE  REPRESENTATION. 
DNOSPOKE  -  NUHBER  OF  SPEECH  FILES  IN  THE  WORD'S  TRAINING  SET. 
PHSCORE  •  ARRAY  (CONSISTING  OF  SCORES  FOR  EACH  PROTOTYPE  PHONEHE) 


USED  TO  DETERMINE  THE  NEXT  WORD  PHONEME. 

PHCNT  -  THE  NUMBER  OF  TIMES  A  PHONEME  IS  CONSECUTIVELY  CHOSEN 
AS  THE  NEXT  WORD  PHONEME. 


*»»»«»»*«** *»***«« «»«*«*« *«»*»*»»* ««»**«** ««*««**«****««******« *««»«*»«»»*«} 


SEGMENT  PROCEDURE  AUTODET( VAR  NOWORD: INTEGER;  NOPHVECTORS: INTEGER; 

VAR  CHNGEF LAG: BOOLE AN) ; 


CONST 

THRESHH0LD=0.30; 

TYPE 

PHARRAY-ARRAYC1.. NOPHONEMESI  OF  REAL; 
VAR 

PHINTtlNTEGER; 

DPHREP : ARRAY! 1 . .MAXPHONES]  OF  INTEGER; 

DNOSPOKE; INTEGER  5 

PHSCORE:PHARRAY; 

PHCNTsARRAYCl,. MAXPHONES]  OF  INTEGER; 
WORDDET ,WTEMP:STRINGCSPELLENGTH] ; 
J,K,I,INT,JINT:INTEGER; 
DETREC0RD,DETR2PCNT:INTEG2R; 

COUNT : I NTEGER ; 

TEMPsINTEGER; 

FLAG {BOOLEAN; 

INSsARRAYtO.. NOPHONEHES]  OF  REAL; 
NJEOFS:INTEGER; 

FINISHEDsBOOLEAN; 


{lIHIIHHHHOHHOOOHHHOHiHHHHIHHHHOHIimHH* 


PROCEDURE  FIXREP  DESCRIPTION 


SINCE  THE  WORD  RECOGNITION  ALGORITHM  EXAMINES  3  WORD  PHONEMES 
AT  A  TIME,  TO  DETERMINE  TRANSITIONS  BETWEEN  PHONEMES,  IT  IS  NECESSARY 
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(OR  EXTREEHELY  DESIRABLE)  THAT  AHY  COMBINATION  OF  THREE  ADJACENT 
PHONEMES  IN  THE  WORD  REPRESENTATION  BE  UNIQUE.  THIS  PROCEDURE 
INSURES  THIS  UNIQUENESS  BY  ELIMINATING  ALL  PHONEMES  THAT  ARE  A 
DUPLICATE  OF  ONE  OF  THE  PRECEDING  TWO  WORD  PHONEMES. 


a**#########**###**##*#*#*#***#**#####****#*##**#########*##########*} 


PROCEDURE  FIXREP; 
VAR 

I , J : I  NT  EGER  ; 
CFLAG:BOOLEAN; 

BEGIN(*FIXREP«) 


CFLAG:=FALSE; 

WITH  WDATA  DO 
BEG1NUW1TH*) 

IF  PHINT>2  THEN 
BECINUIF*) 


{Determine  if  any  tao  consecutive  aord 
phoneaes  are  duplicated,  and  eliainate  the 
second  duplicate.} 


FOR  I  a  =  3  TO  PHINT  DO 
BEGIN(«FOR«) 

IF  PHREPLI-2]*PHREPCn  THEN 
BECIN(*IF») 

FOR  J: si  TO  (PHINT-1)  DO 
BEGIN(«F0R») 

PHREP[J3:*PHREPt J+13; 

END(*FOR«) ; 

PHREPCPHINT3  a  =0; 

PHINT::PH1NT-1; 

CFLAG:sTRUE; 

END ( » I F» ) ; 

END(»FOR«>; 

{Deteraine  if  the  first  and  third  aord  phoneaes  of 
any  three  consecutive  phoneaes  are  duplicates, 
and  eliainate  the  second  duplicate.) 

FOR  I:*2  TO  PHINT  D" 

BEGIN ( *FOR* ) 

IF  PHREPn-13  =  PHRtr'CI3  THEN 
8EGIN(»IF») 

FOR  J:*I  TO  (PHINT)  DO 
BEGIN(«FOR«) 
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<r 


PHREPt J-l] :sPHREPCJl; 

END ( »FOR* ) ; 

PHREPtPHINTl s  =  0 ; 

PH1NT::PHINT-1; 

CFLAGs^TRUE; 

END  <  *  I F*  > ; 

END(*F0R*>; 

ENDUIF*); 

END ( *W I TH*  > ; 

IF  CFLAGSTRUE  THEN 
BEGINUIF*) 

{Reevaluate  representation  <if  any  changes  were  wade) 
to  elininate  any  duplicates  that  could  have  been 
created.) 

FIXREP; 

END ( *IF* ) ; 

END( ^PROCEDURE  FIXREP*); 


{e***************************************************************** 


PROCEDURE  MAXCHOICE  DESCRIPTION 


THIS  PROCEDURE  ASSIGNS  THE  NEXT  WORD  PHONEME  FOR  THE 
INITIAL  REPRESENTATION  BY  CHOOSING  THE  PHONEME  WITH  THE  HIGHEST 
SCORE  IN  THE  ARRAY  PHSCORE. 


**********  ****************** ******** *********************************} 


PROCEDURE  HAXCHOICE ; 


HAXCH:REAL; 

J :  INTEGER ; 

BEGIN (*HAXCHOICE*> 


(Determine  phonene  with  highest  score.} 
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HAXCH::0; 

FOR  Js=  1  TO  NOPHONEI1ES  DO 
BECINt  *FOR«) 

IF  PHSCQRECJ]>MAXCH  THEN 
BEGIN  < » I F  » ) 

WDATA . PHREPCPHINTD ;= J ; 

HAXCH:£  PHSCORECJ3*, 

END { * I F* ) ; 

PHSCORECJ3isO; 

END(»FOR»); 

#RITELN( ' PHONEME  CHOSEN5  ' ,UDATA.PHREP[PHINT]  :2, '  PHINT=  '  ,PHINT ;2 , 
1  SCORE=  ‘ ,HAXCH); 

(Update  word  phoneie  representation.} 

IF  PHINTOl  THEN 
BEGIN ( » IF* ) 

IF  WDATA ,PHREPCPHINT3=WDATA. PHREPCPHINT-13  THEN 
BEGINUIF*) 

WDATA.PHREPtPHINT]s=0; 

PHINT:  =  PHINT-1 ; 

PHCNTtPHINT] :sPHCNTCPHlNT3*l ; 

END ( » IF* ) ; 

ENDUIF*); 

PHINTs  =  PHINT+l ; 

END(*PROCEDURE  HAXCHOICE*)? 


{»***»**»******#**###«#****#**«#«««*##***»#******#*#«*»*#«***#****##*# 


PROCEDURE  SUMXVECTORS  DESCRIPTION 


THIS  PROCEDURE  SUMS  THE  SCORES  OF  EACH  PHONEME  OCCURRING 
IN  THE  NEXT  "NOPHVECTGRS"  VECTORS  OF  THE  CURRENT  SPEECH  FILE. 
IT  SHOULD  BE  NOTED  THAT  THIS  SUM  IS  ALSO  SUMMED  WITH  THE 
PREVIOUS  SUMS  OBTAINED  FOR  THE  PREVIOUS  SPEECH  FILES,  THIS 
IS  DONE  TO  DETERMINE  A  WEIGHT  FOR  EACH  PHONEME  THAT  IS  USED 
BY  PROCEDURE  HAXCHOICE  TO  CHOOSE  THE  NEXT  WORD  PHONEME. 


»»»**•»»«*«•««««»****»««**»»*»*«*»*«•*»»*<*«****«*********»*****»*****} 


PROCEDURE  SUMXVECTORS( VAR  PHSCRE-.PHARRAY) ; 


VAR 

I,  Js  INTEGER  ,* 

vscore^ argument : REAL ; 

BEG IN ( *SUMX VECTORS* ) 

DETREPCHT :  =  1 ; 

Unitialize  array  used  to  Keep  track  of 
phoneae  scores*} 

FOR  I:=l  TO  NOPHONEMES  DO 
BEGIN  (*F0R») 

INSCI3::0*0; 

END ( *FOR* ) ; 

FLAGs=TRUE; 

WITH  IOUT  DO 
BEGIN( ttUITH*) 

WHILE  ( FL AG  =  TRUE )  DO 
BEGIN(*WHILE») 

(Update  score  for  each  phoneae  occurring  in 
next  “NOPHVECTORS"  vectors.} 


FOR  I jsl  TO  NOGUESSES  DO 
BEGIN(*FOR*) 

WITH  IOUT  DO 
BEGINUWITH*) 

TEMP:=ICHPHONECI,K3? 

ARGUMENT ::ISFACTQR(K3/SFE; 

VSCORE:=<l-ICH VALUE Cl ?K3/100)/CHVE; 
VSCORE:M1-POUER<VSCORE,CHVF>>*(1  -POWER ( ARGUMENT, SFF) ) 
INSCTEMP3 :sVSC0RE+INSCTEHP3  $ 

END( »«ITH») ; 

END < *F0R» ) ; 

K:*K+l; 

DETREPCNT:=DETREPCNT+1 ; 

IF  ( (K)RCRDLENGTH)  AND  t CONTFL AG  =  1 > )  THEN 
BEGIN ( *IF* ) 

GET(SPECH); 

IOUT  s  =  SPECHA ; 

Ks  =  l; 

END  <  * I F* ) 

ELSE  IF  (X)RCRDLENGTH) AND(C0NTFLAG:0)THEN 
BEGINUELSE*) 

X  t  - 1 ; 

FLAG :  =  FALSE ; 

END ( *ELSE* ) ; 

IF  (DETREPCNT)NOPHVECTORS)  THEN 
BEC1N<«IF») 

DETREPCNTs  =  1 ; 

FLAG  < s  FALSE ; 


{Update  variable  that  Keeps  track  of  each 
phoneme's  score  for  each  of  the  word's  training 
files  for  the  word  phoneme  being  determined. } 


FOR  Js=l  TO  NQPHONEHES  DO 
BEGIN ( *F0R* ) 

IF  INSCJDO.O  THEN 
BEGINt  *IF« ) 

PHSCRE  E  J  3 ; =  PHSCREE J3  + 

INSIJ3; 

END ( *IF* ) ; 

ENDUFOR*); 

END  <  *  I F* ) ; 

END ( *WH I LE* ) ; 

IF  DETREPCNT>1  THEN 
BEGINUIF*) 

FOR  J  t  =  l  TO  NQPHONEHES  DO 
BEGIN ( *F0R*  > 

IF  INSE J 3)0*0  THEN 
BEGIN(«IF«) 

PHSCRECJ]  :sPHSCRECJ3+ 

INSCJ3 ; 

END ( »IF») ; 

END ( *F0R* ) \ 

END < *IF* ) ; 

END ( * WI TH* ) ; 


END( ^PROCEDURE  SUMXVECTORS*) ; 


PROCEDURE  FINDEND  DESCRIPTION 


THIS  PROCEDURE  ATTEMPTS  TO  FIND  THE  STARTING  POINT  OF  THE 
NEXT  WORD  PHONEME  IN  EACH  OF  THE  WORD'S  TRAINING  FILES  BY  USING 
AN  ALGORITHM  THAT  iS  SIMILAR  TO  THE  ACTUAL  RECOGNITION  ALGORITHM, 
THE  BAIN  DIFFERENCE  BETWEEN  THE  ALGORITHM  USED  IN  THIS  PROCEDURE , 

IS  THAT  A  THRESHOLD  MUST  BE  USED  IN  DETERMINING  WHEN  A  WORD  PHONEME 
ENDS,  SINCE  THE  NEXT  WORD  PHONEME  IS  UNKNOWN.  WHEN  THE  STARTING 
VECTOR  OF  THE  NEXT  PHONEME  IN  A  SPEECH  FILE  IS  DETERMINED, 

PROCEDURE  SUMXVECTORS  IS  CALLED.  AFTER  ALL  THE  SPEECH  FILES  HAVE 
UNDERGONE  THIS  PROCESS,  PROCEDURE  MAXCHOICE  IS  CALLED. 

THIS  PROCESS  IS  REPEATED  UNTIL  THE  WORD  PHONEME  REPRESENTATION 
ACCOUNTS  FOR  EVERY  VECTOR  IN  EVERY  TRAINING  FILE.  IT  SHOULD  BE 
EMPHASIZED  THAT  THIS  PROCEDURE  IS  USED  ONLY  TO  ARRIVE  AT  AN 
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INITIAL  WORD  REPRESENTATION. 


»«»*«»»«** «««***«* »«**»** *x***««« »******»*«»««*»**»««**««* *«****»***«} 


PROCEDURE  FINDEND; 

VAR 

PRPHONE:REAL; 

I r J :  INTEGER ; 

ENDOFF1LE: BOOLE AN; 

VSCORE , ARGUDENT :REAL ; 

PHINS:ARRAYCO< . NQPHONEHES D  OF  REAL; 
NAX:REAL; 

PHTEMP: INTEGER ; 

BEGINUFINDEND*) 


RESET(SPECH) ; 

N0E0FS:*0; 

DETRECORD:=0; 

WHILE  NOT  EOF(SPECH)  DO 
BEGINUWHILE*) 

SEEX(SPECH,DETRECORD) ; 

CET(SPECH); 

IOUT:sSPECH* ; 

IF  I  OUT • SPELL -UORDDET  THEN 
BEGIN  <  *IF* ) 

FOR  Js= 1  TO  NOPHONEMES  DO 
BEGIN ( *FOR* ) 

PHINSCJ] ::0.0; 

ENDUFOR*); 

BEGINREC:=DETRECORD; 

INT:=0; 

Ks  =  l ; 

REPEAT 

FINISHED: =FALSE; 

ENDOFFILE:=TRUE; 

I NT i=INT+l; 

X:s (PHCNTCINT]»3)fX; 

IF  (X)RCRDLENGTH)  AND  < 1  OUT . CONTFL AG  =  1 )  THEN 
BEGIN  I *1F  *  > 

GET(SPECH) ; 

IOUT:=SPECH»; 

DETRECORD:5DETRECORD+l ; 

K:*X-RCRDLENGTH; 

END ( «IF* ) 

ELSE  IF  (X)RCRDLENGTH) AND  (IOUT .CONTFLAG-OITHEN 
BEGIN ( «ELSE» ) 

X:*40; 


ENDUEISE*); 

(Determine  where  the  last  word  phoneue  ends  in 
the  speech  fie.) 

REPEAT 

PRPHONE:sO; 

FOR  J:=  1  TO  NOGUESSES  DO 
BEGIN ( *FOR* ) 

IF  (WDATA.PHREPCINT]=I0UT.ICHPH0NECJ,K3)  THEN 
8EGIN(#IF«) 

WITH  IOUT  DO 
8EGIN(#UITH») 

PHTEHP  s  =  ICHPHONEC J ,K3 ; 

ARGUMENT t=ISFACTQRCK3/SFE; 
USC0RE;=(1-1CHVALUE[J,K]/100)/CHUE; 
VSCORE:  =  ( 1 -POWER! VSCORE  ,CHVF)  )#( l* 

POWER (ARGUMENT  ,SFF) ) ; 
PRPHQNE:=VSCORE; 

END(*WITH*)? 

END ( *IF« ) ; 

END ( «FOR* ) ; 


{If  the  current  word  phoneue  did  not  occur 
in  the  choices  for  the  current  vector,  then  test 
to  determine  if  it  occurs  in  the  next  NOPHVECTORS 
vectors.) 

IF  PRPH0NE=0  THEN 
BEGIN ( »IF* ) 

C0UNT;=0; 

FOR  1 5  =  1  TO  NOPHVECTORS  DO 
BEGIN(«F0R«) 

Ks=X*l v 

IF  ( X)RCRDLENGTH) AND ( IOUT. C0NTFLAGS1 )  THEN 
BEGINUIF*) 

GET  <  SPECH ) ; 

IOUT!*SPECH*? 

DETREC0RD:  =  DETREC0RD  +  1 ; 

Xssl; 

END(»IF») 

ELSE  IF  < K>RCKDLENCTH> AND( I0UT.C0NTFLAG  =  0 >  THEN 
BEGIN ( «ELSE* ) 

IF  C0UNT=0  THEN 
COUNT 
XiMl? 

FINISHED:  =TRUE; 

ENDUELSE*); 

IF  FINISHED=FALSE  THEN 
8EGIN(«IF») 

COUNT:sCOUNT+l; 

FOR  J:=l  TO  NOGUESSES  DO 
BEGIN ( *F0R*  > 

WITH  IOUT  DO 
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BECIN( *WITH* ) 

PHTE«Pi=ICHPH0NECJ,K3; 

ARGUMENT  s  =  ISFACTORLKD/SFE? 

VSCORE:= (l-ICH VALUE [J  >K]/100)/CHVE? 
VSCORE : - ( 1 -POWER! V SCO RE, CHVF) )#(1- 
POWER(ARGUI1ENT,SFF)); 
PHINS£PHTEHP3:=VSCORE+PHINS[PHTEMP3 ; 
END(*WITH») ; 

END(«FOR«>‘, 

ENDUIF*); 

END(*FOR#) ; 

IF  (PHINSC WDATA.PHREPE 1NTD3/COUHT XTHRESHHOLD  THEN 
BEGINUIF*) 

FIN1 SHED  5  =  TRUE ; 

END ( *  IF*  > ; 

HAX:=0; 

FOR  J  :  =  1  TO  NOPHONEHES  DO 
BEGIN ( «FOR» ) 

IF  PHINSCJ33HAX  THEN 
BEGINUIF*) 

PHTEMfs-J; 

HAX:  =  PHINSC J] ; 

ENDUIF*)*, 

PHINSC J3  s  =  0; 

END ( *FOR* ) ; 

IF  < K-COUNT  <  0)  THEN 
BEGIN(*IF«) 

SEEK(SPECH , (DETRECORD-1 ) ) ; 

GET(SPECH) ; 

IOUT:*-SPECH*? 

Xi SK-CQUNT+RCRDLENGTH; 

END  <  *  I F* ) 

ELSE 

BEGINUELSE*) 

X;sX-COUNT; 

END(*ELSE«) ; 

END<«IF*>; 

Ks  =  K+l ; 

IF  <K>RCRDLENGTH) AND  ( I  OUT . CONTFLAG=  l )  THEN 
BEGINOtlF*) 

GET ( SPECH) ; 

IOUT  s  =  SPECH* ; 

X:sl  * 

ENDU1F*) 

ELSE  IF  < X >RCRDLENGTH) ANDdOUT . CONTFLAG  =  0 ) THEN 
BEG  I N ( *  ELSE  * ) 

ENDOFF I LEs=FALSE; 

FINISHEDs-TRUE; 

END <  *ELSE« ) ; 

UNTIL  <FINISHED  =  TRUE) ; 

UNTIL  (IMTMPH1NT-1)); 

IF  ENDOFFILE=TRUE  THEN 
BEGINUIF* ) 

SUHXVECTORS(PHSCORE) ; 
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ELSE 

BEG1NUELSE*) 

NQEQFSi=NQEOFS+l; 

END <  *ELSE* ) ; 
DETRECORDs=BEGINREC; 
SEEK(SPECH;DETRECORD); 
REPEAT 

GET(SPECH)', 

10UT:sSPECHA ; 

DETRECORDs  =  DETREC0RC  +  1 ; 
UNTIL  (IOUT ,C0NTFLAG=0) ; 
GET(SPECH)  5 
ENDUIF*) 

ELSE 

BEGINUELSE*) 

DETRECORD:=DETRECORD+l ; 
GET(SPECH) ; 

END ( *ELSE*  > ; 

ENDUUHILE*) ; 

WRITELN ( 'NO.  EOFS*  ',NOEOFS); 

IF  NOEOFS<DNOSPOXE  THEN 
BEGIN ( » I F  * ) 

MAXCHQICE; 

IF  PHINT<  =NAXPHONES  THEN 
BEGINUIF*) 

FINDEND? 

END  C  *1 F  » ) 

ELSE 

BEGINUELSE*) 

PHINT :  =  PHINT-1 ; 
ENDUELSE*)*, 

END(*IF*) ; 

END( *PROCEDURE  FII&END*); 


I************************************************************* *»***» 


PROCEDURE  CORRECTREP  DESCRIPTION 


THIS  PROCEDURE  ADDS  PHONEMES  TO  THE  CURRENT  WORD  REPRESENTATION 
ON  THE  BASIS  OF  THE  STATISTICS  COLLECTED  USING  THE  PREVIOUS  WORD 
REPRESENTATION,  THE  PHONEMES  ARE  INSERTED  BASED  ON  THE  SUM  OF  THE 


INSERTION  AND  SUBSTITIQN  SCORES  FOR  EACH  WORD  PHONEME ♦ 


PROCEDURE  CORRECTREP( NOWORD: INTEGER;  VAR  CHNGEFLAG: BOOLEAN); 
VAR 

CF ACTOR, I MAX, I ,J:INTEGER; 

SUBINS:ARRAY[0, .MAXPHPLUS1 , 1 . .NOPHONEMES]  OP  REAL; 
NAXsREAL; 

DPHREP:ARRAY[1 . .MAX PHONES]  OF  INTEGER; 


BEG1NUCORRECTREP*) 

URITELN ( ' CORRECTREP' ) ; 

SEEKISFILE, NOWORD); 

GET(SFILE) ; 

USTATS:  =  SFILEA ; 

SEEK < UF I LE , NOWORD) ; 

GET(WFILE) ; 

WDATA:=WFILEA ; 

{Save  initial  word  representation.) 

FOR  I s  =  1  TO  MAXPHONES  DO 
BEGINUFOR*) 

DPHREPtI3:=WDATA.PHREPCI]; 

ENDUFOR*); 

{Initialze  the  variables  used  to  indicate 
each  phoneme's  substition  and  insertion 
statistics  for  each  word  phoneae.) 

FOR  1**1  TO  NOPHQNEMES  DO 
BEGINUFOR*) 

SUBINStO,I3 :;0.0; 

SUBINS[<MAXPH0NES+1),I3:=0.0; 

ENDUFOR*) ; 

FOR  J;s 1  TO  MAXPHONES  DO 
8EGINUF0R*) 

FOR  I s= 1  TO  NOPHONEMES  DO 
BEGIN ( *FOR* ) 

SUBlNStJ ,I3;sWSTATS.XSUBYC J ,13+WSTATS.XINAFTZt J *13; 
END ( « FOR* ) ; 

ENDUFOR*); 

F'HINT :  =  1 ; 

CFACT0R:=0; 
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WITH  WDATA  DO 
BEGIN ( *y ITH* ) 

WHILE  (PHREPCPHINT3O0)  AND  <PH1NT<  FAXPHONES)  DO 
BEGIN(«UHILE«) 


{Test  to  see  if  statistics  indicate  that  the 
current  mord  phoneme  is  the  best*) 


N  AX ;  =  0  5 
IHAX:=0; 

FOR  I :=1  TO  N0PH0NEHES  DO 
BEGIN  <  *FOR»  > 

IF  (SUBINSCPHlNT,l])HAX)OR(<inAX-PHREPCPHINT])AND 
(SUBINSCPHINT^H-HAX))  THEN 
B  EG 1 N  < « I F* ) 

H AX :  =  SUB  I NS [PHI NT  ,  I) ; 

IHAX::I; 

END(*1F») ; 

END  <  *F0R* ) ; 


{If  the  current  word  phoneme  is  not  the  best, 
add  another  phoneme  to  the  representation.) 

IF  (IHAXOPHREP[PHINT+CFACTOR])AND<IHAXOO)  THEN 
BEGIN  <  «IF* ) 

CHNGEFLAG::TRUE; 

IF  PHREPCHAXPHONESK  >0  THEN 
BEG I N ( *  I F* ) 

FOR  J:=l  TO  HAXPHONES  DO 
BECINOFOR* ) 

DPHREPCJ)  :sPHREPCJ) ; 

END(«F0R*); 

{Determine  if  the  nem  phoneme  should  be  added  before 
or  after  the  current  mord  phoneme  based  on  the 
statistics  for  the  adjacent  mord  phonemes;  and  make 
the  appropriate  revision.) 

IF  (SUBINS[(PHINT-1),1HAX)>  =  SUBINS[(PHINT  +  1)  ,IHAX])AND 
( SUB  INS t (PH I  NT- 1) ,IHAX3>SUBINS[(PHINT-1), 
(PHREPCPHINT+CFACTOR] ) ] )  THEN 
BEGINt  *IF«) 

FOR  Js=(PHlNT+CFACTOR)  TO  (HAXPH0NES-1 )  DO 
BEGIN(*F0R*> 

PHREPC J+l) :  sDPHREP[J] ; 

END ( *F0R* ) ; 

PHREPCPHINT+CFACTOR) js INAX; 

ENDUIF*) 

ELSE 

8EGIN(*ELSE») 

FOR  J:* (CFACTOR+PHINT  +  1 )  TO  (HAXPHONES-1 )  DO 
BEGINUFOR*) 

PHREPCJ+1];-DPHREP[J]; 

END(»F0R«) ; 
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PHREP [ PH INT+l+C FACTOR] : sIHAX; 

END(«ELSE*>; 

CFACTORs=CFACTOR+l; 

END( kIF*) 

ELSE 

BEG1N<*ELSE*) 

{If  the  aaxiaua  nuaber  of  aord  phoneaes  has  been 
reached;  sinply  replace  the  current  word  phonene 
with  the  one  indicated  by  the  statistics.) 

PHREPtPHINT+CFACTOR] :J1NAX; 

END(*ELSE»); 

END(»IF»); 

PHINT :  =  PHINT  +  1 ; 

END (*UHILE») ; 

ENDUUITH»); 

FIXREP? 

SEEK (WFILE  ,NQUQRD) ; 

WFILE* : =WDATA ; 

PUT (WFILE) ; 


END(*PROCEDURE  CORRECTREP#) ; 


{#**«*# »*******#**#*#»»«*****« »*#«**»**»*»**«****«***«*«***«* *»»*** 


PROCEDURE  PICKBEST  DESCRIPTION 


THIS  PROCEDURE  USES  THE  SUN  OF  EACH  WORD  PHONENE' S  INSERTION 
AND  SUBSTITUTION  STATISTICS,  TO  REPLACE  ALL  WORD  PHONEMES  WITH 
THE  PHONENE  WHOSE  SUMMATION  IS  HIGHEST. 


IHHHOiOHHIlHHHHHIOHHmomiiltHHIHHIHHtOHHIOl} 


PROCEDURE  PICKBEST! NOWORD: INTEGER;  VAR  CHNGEFLAG : BOOLEAN ) ; 


in 


I  MAX , I , J : INTEGER ; 

NAX:REAL; 

SUBINS:ARRAYCO.  .flAXF’HPLUSl  ,1 « . NOPHONEflES 3  OF  REAL; 


BEGIN(*PICKBEST») 

WRI TELN ( ' PICKBEST' ) ; 

SEEK(SFILE,NOWORD>; 

GET(SFILE) ; 

WSTATS:=SF1LE* ; 

SEEK<UFILE, NOWORD); 

GET(WFILE) ; 

WDATA: :UFILE* ; 

Unitialize  variable  array  that  indicates 
each  phoneiies  substitution  and  insertion 
plausibilities  for  each  word  phoneweiJ 

FOR  I:«l  TO  NOPHONENES  DO 
8EGIN(*F0R«) 

SUBINSCO/I] ::0.0; 

SUBINSC (NAXPHONES+1 )  ,I]:=0.0; 

END(*FOR») ; 

FOR  J:  =  l  TO  NAXPHGNES  DO 
BEGIN(*FOR») 

FOR  1**1  TO  NOPHONENES  DO 
BEGIN ( »FOR« ) 

SUBINSCJ,I]:=WSTATS.XSU8YCJ,n+WSTATS,XINAFTZCJ,I]; 

END(»FOR»); 

END(*F0R») ; 

PHINT:*1; 

WITH  WDATA  DO 
BEGIN(*WITH»> 


{Test  to  see  if  the  statistics  indicate 
that  the  current  word  phonewe  is  the  best.J 

WHILE  (PHREPCPHINT3<  >0 ) AND(PH1NT<  sHAXPHONES) DO 
BEGIN! ftWHILE*) 

MAX: =0; 
l«AX:=0; 

FOR  1**1  TO  NOPHONENES  DO 
BEGIN ( »FOR« ) 

IF  ( SUBINS [PHI  NT >I])NAX)OR< (INAXsPHREP[PHINT3)AND 
( SUB  I NS t PH I NT  ;I3  =  NAX))  THEN 
BEG I N ( *  I F  » ) 

NAX: 5  SUB  I NS CPH I  NT  »I J? 

INAXs-I ; 

END ( # I F  * ) ; 
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1 


END ( *FOR« ) ; 

{If  current  »ord  phoneae  is  not  the  best, 
then  replace  it  with  the  one  the  statistics 
suggest  is  better.) 


IF  ( IMAX< )PHREPCPHINT)) AND( IHAX<  >0)  THEN 
6ECI N  <  *  I F* ) 

CHNGEFLAG:sTRUE; 

PHREPCPHINTD :  =  IHAX ; 

ENDUIF*); 

PRINT s  =  PHINT  +  l ; 

ENDUWHILE*) ; 

END ( *UI TH* ) ; 

FIXREP; 

SEEK(WFILE,N0W0RD) ; 

UFILEA:=UDATA; 

PUT <  WFI LE ) ; 


ENDUPROCEDURE  PICKBESTM; 


{a####**#*#####*****#*#*######*###**#*#*****##*********#*****##***# 


PROCEDURE  ADJUSTDEL  DESCRIPTION 

THIS  PROCEDURE  DELETES  WORD  PHONEMES  BASED  ON  THE  NUMBER 
OF  TIMES  EACH  PHONEME  WAS  DELETED,  THE  DELETION  STATISTICS  WERE 
COLLECTED  USING  THE  CURRENT  REPRESENTATION, 

»«####»**#****###***##»########**#***#*###»«*#*****»#*****##»*»*###*##} 

PROCEDURE  ADJUSTDEL (NOWORD: INTEGER;  VAR  CHNGEFLAGsBOOLEAN) ; 

VAR 

CFACT0R,I,J:1NTEGER; 
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BEGINUADJUSTDEL*) 

URITELNi ' ADJUSTDEL' ) ; 

SEEK(SFILE,NOWORD>; 

CET(SFILE) ; 

USTATS:*SFILE* ; 

SEEKIHFILE, NOWORD)? 

GET ( UP  1 LE ) ; 

UDATA  s  =  WFILEA ; 

{Save  current  #ord  representation.) 

FOR  I : = 1  TO  MAXPHONES  DO 
BEGIN ( *F0R* ) 

DPHREPCI] s  =  UDATA»PHREPCII ; 

ENDUFOR#); 

CFACT0Ri=0; 

FOR  I : =1  TO  HAXPHONES  DO 
8ECINUF0R#) 

{Delete  all  current  eord  phoneees  that  eeet 
the  fol I o»ing  test.) 

IF  USTATS.NDELm>R0UND(USTATS.N0$P0XE/2)  THEN 
BEGINUIF#) 

CHHGEFLAG ;  =  TRUE ; 

FOR  J:M  I-CFACTOR)  TO  (HAXPHONES-i)  DO 
8ECINUF0R#) 

DPHREPCJ]  :  =  DPHREPCJ+13 ; 

ENDUFOR#) ; 

DPHREPLHAXPHONES):=0; 

CFACTOR:=CFACTOR+l5 

ENDUIF#); 

ENDUFOR#); 

FOR  I 1  TO  HAXPHONES  DO 
BEGINUFOR#) 

UDATA. PHREP[I]:  =  DPHREPU]; 

ENDt»FOR»); 

PHINT  a  =  0; 

REPEAT 

PHINTssPHINT+l ; 

UNTIL  (UDATA.PHREPCPHINTD=0)OR<PHINT=HAXPHONES) a 
IF  UDATA. PHREPCPHINTDsO  THEN 
PHINT :  =  PH  I  NT  - 1 ; 

FIXREP; 

SEEK(UFILE,NOUORD) ; 

WFILE*  a  =  WDATA ; 

PUT(UFILE) ; 


ENDUPROCEDURE  ADJUSTDEL#) ; 
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BEG1NUAUT0DET*) 


IF  OPERATION^ ' AUTODET'  THEN 
BEG 1 N ( * I F*  > 

WRITELNt ' AUTODET' ) ; 

SEEK ( UF I L E  ,NOUORD) ; 

GET(WFILE)? 

WDATA:  =  UFILE*  ? 

WORDDET;=WDATA. SPELLING? 

{Initialize  the  word  representation  to  zero's*) 

FOR  I s=l  TO  HAXPHONES  BO 
BEGINUFORO 

WBATA.PHREPtI]:=0? 

PHCNTC1) s=l ? 

ENDUFOR*)? 

DN0SP0KE:=0? 

FOR  J:=  1  TO  NOPHONEHES  BO 
BEGIN  (»F0R*) 

PHSC0RECJ]:=0.0? 

ENB  UFOR*)? 

RESET (SPECH) ; 

DETREC0RB:=0? 

{Deteriine  nu»ber  of  training  files  stored  for  the 
■ord,  and  determine  the  first  word  phoneae  in  the 
■ord  representation.) 

WHILE  NOT  EOF(SPECH)  BO 
BEGINUWHILE*) 

SEEK (SPECH ,DETREC0RD)? 

GET(SPECH) ; 

IOUT:=SPECH* ? 

IF  IDUT.SPELL=WQRDDET  THEN 
BEGIN ( *IF« ) 

DNOSPOKE i =  DNOSPOKE  + 1 ; 

WRITELNt ’ NUMBER  OF  SPEECHFILES  FOR  WORD  =  DNOSPOKE) 
BEGINREC:=DETRECORD? 

PHINTs*l? 

K:  =  l  ? 

SUHXVECTORS(PHSCORE) ? 

BETREC0RD:  =  BEG1NREC  \ 

SEEK( SPECH, BETRECORB) ? 

REPEAT 

GET(SPECH) ; 

IOUT:*SPECH*  ? 

DETREC0RD:=DETREC0RD+1? 

UNTIL  ( I0UT«C0NTFLAG  =  0) ; 

GET ( SPECH ) ; 

ENB(*IF«) 

ELSE 

BEGINUELSE*) 

BETREC0RB:  =  BETREC0RD  +  1 ? 


GET(SPECH) ; 
END(»ELSE«); 
END  UWHILE*)? 


URITELN? 

URITELH; 

HAXCHOICE; 

F1NDEND; 

FIXREP; 

SEEK < UF I LE ,NOWORD) ; 

WFILE* : 5WDATA ; 

PUT  (I4F1LE) ; 

END ( *1 F* ) 

ELSE  IF  OPERATION5 ' CORRECTR'  THEN 
BEGIN(«ELSE«) 

CORRECTREP(NOWORD,CHNGEFLAG)? 

END(#ELSE«) 

ELSE  IF  OPERATION5 ' PICKBEST'  THEN 
BEGINUELSE*) 

PICKBEST (NOUORD ,CHNGE FLAG) ; 

ENDOELSE*) 

ELSE  IF  OPERATION5 ' ADJUSTDE'  THEN 
BEGIN(«ELSE*> 

ADJUSTDEU NOWORD  >CHNGEFL AG) ; 

END (*ELSE*) 

ELSE 

BEGIN(»ELSE») 

WRITELNt ' ERROR;  NO  OPERATION  SPECIFIED.') 
ENDUELSE*); 

FOR  I s  =  1  TO  HAXPHONES  DO 
BEGINUFOR*) 

WRITE (STATSOUT ,WDATA.PHREPC13:3) ; 
END(«FOR«); 

WR1TELN< STATSOUT ,WDATA. SPELLING)  5 
END  ^PROCEDURE  AUTODET#); 
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{•a#*##*###**#****#########*#**########*####**####*###****#######** 


PROCEDURE  STOREWORD  DESCRIPTION 


THIS  PROCEDURE  ALLOWS  A  USER  TO  STORE  A  NUMBER  OF  SPEECH 
FILES  1H  FILE  SPECH  OR  FILE  RECOC. 


#«**#**#*##»*#*#*##**#*##**#*######******#**###»»*«##*##*##*#*#***#} 


SEGMENT  PROCEDURE  STORESPEECH; 
VAR 

NOWORD, I ; INTEGER; 

ANS,  ANSWER: STRING C33; 
TEHPWORD:STRING[SPELLENGTH] ; 
FLAG:BOOLEAN; 


BEGIN (*STORESPEECH«) 

INITFILE; 

ENDFLACs-FALSE; 

WRITELN; 

WRITELN!' ENTER  "YES"  TO  STORE  SPEECH  FILES  FOR'); 
WRITELN( ’INITIALIZING  WORD  STATISTICS;  ELSE  ENTER  '); 
URITELN < ' "NO”  TO  STORE  SPEECH  FILES  FOR  FUTURE');; 
WRITELN! ‘ RECOGNITION  SCORING.' ); 

WRITECANSWER  -  '); 

READLN( ANSWER) ; 

WRITELN; 

IF  ANSWER1' YES’  THEN 
BEGIN(*IF») 

REPEAT 
WINIT1 ; 

IF  ENDFLAGsFALSE  THEN 
BEGIN(»IF«) 

STOREWORD; 

END! *IF») ; 

WRITELN; 

WRITELN! 'ENTER  "YES"  IF  YOU  WOULD  LIKE  TO  ADD'); 
WRITELN! ' ANOTHER  SPEECH  FILE;  ELSE  ENTER  "NO".'); 
WRITECANSWER  =  '); 

READLN(ANS) ; 

WRITELN; 

WRITELN; 

UNTIL  ANSO'YES'; 
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ENDUIF*) 

ELSE 

BEGINUELSE#) 

FILENAME :=CONCAT( SPEAKER, 'RECOG'); 

F I  LET I TLE(SPECH, FILENAME) ; 

REPEAT 

RESET(SPECH) ; 

URITELN; 

URITELN ( ' INPUT  NAME  OF  SPEECHF1LE' ) ; 
URITE( ' FILENAME  =  '  ); 

READLN (FILENAME ) ; 

URITELN; 

FILETITLE(0UT2, FILENAME); 

RESET  (0UT2) ; 

UITH  IOUT  DO 
BEGIN  UUITH«) 


(Detemine  if  eord  given  in  speech  file 
is  in  the  current  vocabulary.  If  it  is 
not  then  alio*  user  to  change  the  mord's 
spell ing.} 

READLN (OUT 2, SPELL) ; 

FOR  Ii *  (LENGTH(SPELL) )  TO  (SPELLENGTH-1)  DO 
BEGIN  ( »FQR« ) 

SPELL :-C0NC AT ( SPELL, '  '); 

END  (*FOR*) ; 

REPEAT 

RESET  (UFIL.E) ; 

FLAG:-FALSE; 

TEMPW0RD:= '  '; 

NQUORDi=0; 

UHILE  (NOT  EOF(UFILE)  )AND(SPELL< > TEMP WORD)  DO 
BEGIN  («UHILE*) 

SEEK ( UF1 LE , NOWORD ) ; 

GET(UFILE) ; 

UDAT A ;  =  WFILE* ; 

GET(UFILE); 

TEHPUORD!=WDATA. SPELLING; 

NOUORD : =NQUORD+  1 
END  < «WHILE« ) ; 

UNUHBs*NOHORD-l; 

IF  (SPELLOTEMPUQKD)  THEN 
BEGIN  UIF«) 

URITELN; 

URITELN  ('NONE  OF  THE  UORDS  IN  THE  VOCABULARY'); 

URITELN  (’AGREE  UITH  THE  SPELLING  GIVEN  IN  THE'); 

WRITE  ('FILE  "' , FILENAME,'".  THE  SPELLING  IN  ', FILENAME,'  IS') 
URITELN  ('  ”' , SPELL,”'  ' >; 

URITELN  ('DO  YOU  WANT  TO  CHANGE  THE  FILENAME,'”  SPELLING?’) 
URITELN  CENTER  "YES"  TO  CHANGE,  ELSE  ENTER  "NO"'); 

WRITE  ('ANSWER  »  '); 

READLN(ANSUER) ; 
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IF  ANSUERO'YES'  THEN 
BEGIN  (*IF«) 

FLAG:=TRUE; 

ENDFLAG:=TRUE; 

END  <*IF«) 

ELSE 

BEGIN  (#ELSE») 

WRITELN; 

URITELN  ('ENTER  THE  CORRECT  SPELLING  OF  WORD.') 
WRITE  ('WORD  =  '); 

READLN  (SPELL); 

WRITELN; 

FOR  1 ss  (LENGTH(SPELL) )  TO  (SPELLENGTH-1 )  DO 
BEGIN  (*FGR«) 

SPELL: 1 CONC AT (SPELL f '  '); 

END  («FOR*); 

END  <«ELSE<); 

END  (*IF«) 

ELSE 

BEGIN  UELSE*) 

FLAG:-TRUE; 

END («ELSE*> ; 

UNTIL  FLAG; 

END  (*WITH») ; 

IF  ENDFLAG-FALSE  THEN 
BEGIN  (*IF»> 

STOREWORD; 

END(*IF»); 

WRITELN; 

WRITELN( 'ENTER  ••YES"  IF  YOU  WOULD  LIKE  TO  ADD'); 

WRITELNC  ANOTHER  SPEECH  FILE;  ELSE  ENTER  •■NO",'); 

WRITEt  *  ANSWER  =  '); 

READLN(ANS); 

WRITELN; 

WRITELN; 

UNTIL  ANSO'YES'; 

END(*ELSE») ; 

END(«PROCEDURE  STORESPEECH* ) ; 


{**#*******#********#*#*#»*********#*****#****##*****»*************** 


PROCEDURE  UORDRECOG  DESCRIPTION 


THIS  PROCEDURE  COMPUTES  A  SCORE  FOR  EACH  WORD  IN  THE  VOCABULARY 
BASED  ON  HOW  WELL  THE  WORD  HATCHES  THE  GIVEN  FILE,  THE  WORD  WITH  THE 
HIGHEST  SCORE  IS  CHOSEN  AS  THE  WORD  THAT  WAS  SPOKEN.  THE  SPEECH  FILE 
IS  LIMITED  TO  ONE  WORD  SINCE  THIS  IS  AN  ISOLATED  WORD  RECOGNITION 
ALGORITHM.  ONCE  THE  RECOGNITION  SCORES  ARE  OBTAINED,  THE  WORD'S  RECOGNITION 
STATISTICS  ARE  UPDATED  (SINCE  THE  ACTUAL  WORD  SPOKEN  IS  STORED  IN  THE 
SPEECH  FILE,  THERE  IS  NO  PROBLEM  WITH  UPDATING  THE  WRONG  STATISTICS). 

THE  FOLLOWING  INFORMATION  IS  STORED  IN  FILE  “RESULTS"  FOR  EACH  SPEECH 
FILE  EXAMINED!  ACTUAL  WORD  SPOKEN,  SPEAKER,  WORD  BEING  SCORED, 

UORDSCORE,  SCORECOUNT,  PUORD,  DELW,  DELCNT ,  INSRTSCORE. 


IK*###*****#*###***#********###*##**#*#***#*#***#*#*#**#**#*##*****##} 


SEGMENT  PROCEDURE  UORDRECOGCWNUFIBilNTEGER) ; 
VAR 

NOWORD, I:INTEGER; 

HIGHSCORE ,NEXTHIGH£ST:REAL; 

HIGHWQRDiSTRINGCSPELLENGTHI; 

FLAG:BOOLEAN; 


BEGIN(»WORDRECOG«) 

WRITELN( ' UORDRECOG'  ) ; 

SEEK(WFILE,WNUf1B) ; 

GET(WFILE); 

WDATA ;  =  WFILEA ; 

WORD :;UD AT A. SPELLING; 

(Output  autoaatic  recognition  results  heading 
to  file  RESULTS.} 

WRITELN(RESULTS,CHR( 12) ) ; 

FOR  l • s 1  TO  5  DO 
BEGIN  < *FOR* ) 

URITELN(RESULTS); 

END(*FOR») ; 

WRITELNtRESULTS,' ACTUAL  WORD  =  ',WQRD); 


WRITELN(RESULTS) ; 

WRITELN< RESULTS ,'  ':1,'W0RD','  ' :15,'W0RDSC0RE ','  ' :4 , ' SCORECOUNT' , 

'  ':4,'PW0RD','  ‘ift/DEl-UV  *  :B,‘DELCHT V  *  :8,‘  INSRTSCGRE* ) 
FOR  1 5  =  1  TO  96  DO 
BEGIN ( *FOR« ) 

WRITE(RESULTS,'-'); 

ENDUFOR*); 

WRITELN(RESULTS) ; 

URITELNC  WORD  BEING  EXAMINED  *  ',WORD); 

SEEK(SFILE,UNUMB); 

GET  ( SF HE ) ; 

WSTATS;--SFILEA; 

IF  UORDFLAG-O  THEN 
BEGIN(*IF») 

{Initialize  aord's  recognition  statistics.) 

WITH  USTATS  DO 
BEGIN  (*UITH«) 

N0C0RRECT:=0; 

NOATTENPT :sO; 

T'OTDIFF:sO.O; 

MINDIFF:*1.0; 

T0TALSC0RE:=O.O; 

SEEK(SFILE; UNUHB) ; 

SFILE*  :  =  WSTATS; 

PUT  (SFILE) 

END  (*WITH«); 

END ( »I F*  > ; 

F I LENAHE:  =  CONC AT (SPEAKER,'  RECOG'  ) $ 

FILETITLE(SPECH, FILENAME); 

RESET! SPECH) ; 

BEGINRECisO; 

RESET(SPECH) ; 

WHILE  NOT  EOF(SPECH)  DO 
BEG1NUWHILE*) 

SEEK (SPECH, BEG INREC) ; 

GET(SPECH) ; 

IOUTt-SPECH*; 

IF  IOUT.SPELL:UORD  THEN 
BEGIN ( » I F*  > 

IF  WORDCNT)=WORDFLAG  THEN 
BEGIN(*IF*) 

WRITELN(RESULTS) ; 

URITELNtRESULTS , ' NEW  SPEECH  FILE:  ','  SPEAKER  »  ' ,I0UT.SPKR> ; 
HIGHSC0RE:=0.0; 

HIGHWORD:* ' XXXXX' ; 

NEXTHIGHEST:sO.O; 

RESET ( UF I LE ) ; 

NOWORD:  =  0; 

WHILE  NOT  EOF(WFILE)  DO 
8EGIN(*WHILE«) 

SEEKtWFILE, NOWORD); 
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CET(WFILE); 

WDATA:3WF1LEA; 

WORD ;=UDAT A. SPELLING; 

{Output  recognition  information  to  screen,  and 
to  file  RESULTS.} 

URITELN; 

URITELNf ' . '); 

WRITELNCUORD  =  1  ,IOUT. SPELL,'  CONTINUATION  «  =  ',UNUI1B>; 

WRITELNT  WORD  COUNT  3  ',U0KDCNT>; 

WRITELN( ‘ WORD  ATTEMPTED  =  ',WORD); 

GETWORDVARS; 

SCOREWORD; 

HR I TELN( RESULTS >WQKD  ,U0RDSC0RE:8  ,SCORECOUNT*:ll , '  *  :7, 

PU0RD:i>:2,'  *  t A , DELW s 6 s 2 , DELCNT i 1 2 , ‘  * :11,INSRTSC0RE:6:2> 
WRITELN(' WORDS CORE  =  '  , WORDSCORE > ; 

WRITELNC . '  >; 

IF  NOUORD-UNUMB  THEN 
BEGIN(*IF») 

SEEX  <  SFI LE ,WNUHB) ; 

GET(SFILE) ; 

MSTATS:sSFILEA ; 

WSTATS.TOTALSCDRE: 3 WST ATS. TOT ALSCORE+ WORDS CORE; 

SEEK  ( SF ILE  ,UNUP1B) ; 

SFILEA:3USTATS; 

PUT (SF ILE); 

END(*IF«); 

{Determine  highest  scoring  word,  and  score  of 
mord  mith  second  highest  score.} 

IF  (WORDSCORE< :HIGHSCORE) AND( WORDSCORE>NEXTHIGHEST)  THEN 
BEGIN(*IF*> 

NEXTHIGHEST:3WORDSCQRE; 

END(*IF«) ; 

IF  ( (UORDSCORE) )HIGHSCORE)  THEN 
BEGIN ( *  IF* ) 

NEXTHIGHEST ;=HIGHSCORE ; 

HIGHSCORE:3WORDSCORE; 

HIGHUORD:3WORD; 

END(*IF*); 

SEEKUFILE, NOWORD); 

GET(UFILE) ; 

GET(WFILE) ; 

NOWORD  i  =  N0VJ0RD  +  L ; 

END  («UHILE«); 

SEEK  WF  ILE  ,UNUHB) ; 

CET(UFILE) ; 

WDATA: 3WFILEA ; 

WORD: 3 WDATA. SPELLING; 

{Update  word's  recognition  statistics.} 
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SEEK  <  SF I LE ,WNUNB>  ^ 

GET ( SF I LE  > ; 

WSTATS  t  =  SFILEA  5 

WSTATS.  NOATTEMPTi  =  WSTATS.NOATTEMPT  +  l? 

IF  H3GHW0RD=WDATA . SPELLING  THEN 
BEGIN<«IF«) 

WSTATS.  NOCORRECT ?= WSTATS, NOCORRECT+1; 

WSTATS. TOTDIFF  t  =  WSTATS .TOTDIFF+ (HIGHSCORE-NEXTHIGHEST) 
IF  (H1GHSCORE-NEXTH1GHEST) ( WSTATS. HINDI FF  THEN 
BEGIN(«IF*> 

WSTATS. MINDIFF  s  =  H1GHSC0RE-NEXTHIGHEST; 

END  (#IF«); 

END ( *  I F* ) ; 

SEEK(SFILE,UNUHB); 

SFILEA:sWSTATS; 

PUT(SFILE) ; 

END(*IF«) 5 
FLAG:sTRUE; 

WHILE  ( (FLAG=TRUE)OR(IQUT«CONTFLAGsl ) )DQ 
BEGIN  <*WHIL£«> 

FLAG  s  =  FALSE ; 

SEEX(SPECH,BEGINREC) ; 

GET(SPECH) ? 

IOUT:sSPECHA ; 

BEGINREC:=BEGINREC+1 ; 

END  (*WHILE«>; 

GET(SPECH) ; 

WORDCNT :  =  WORDCNT  +  1 ; 

END(»IF») 

ELSE 

BEGINOELSEO 

BEGINREC:=BEGINREC+1; 

GET < S  PECH ) ; 

ENDUELSE*); 

END(*WHILE*> ; 

CLOSE(SPECH) ; 

END  ^PROCEDURE  WORDRECOG*); 


PROCEDURE  FUZZYOPT  DESCRIPTION 


THIS  PROCEDURE  ATTEMPTS'  TO  OPTIMIZE  THE  SPECIFIED  WORD 
FUZZY  VARIABLES.  THE  OPTIMIZATION  IS  ACCOMPLISHED  BY  FIRST  ALTERING 


A  FUZZY  VARIABLE  HITHIN  ITS  SPECIFIED  LIMITS , ' COttPUT INC  THE 
RECOGNITION  RESULTS,  AND  THEN  ADJUSTING  THE  VARIABLE'S  VALUE  AND 
LIMITS  BASED  ON  THESE  RESULTS, 


#»*****####**#**#**##*#*#«**####**##*#**#*»##*****#*#*######*#######*} 


SEGMENT  PROCEDURE  FUZZ Y OPT (NQPT IONS {INTEGER) ; 
VAR 

QUIT:BOOLEAN; 

NOWORD, I, FINT  sINTEGER; 

SAVE VAR ,SAVESCORE:REAL; 

RECSCR1 ,RECSCR2:REAL; 

MINSCORE {REAL ; 

OPTANSWRsARRAYtl, .NOFUZZYVARS]  OF  STR1NGI33; 


{**««*«« *«*«««««« ««*«««« »««««*»»«« *«««*« *»««««««***«««»«*««*« ««*««* 


PROCEDURE  GETOPTVARS  DESCRIPTION 


THIS  PROCEDURE  INITIALIZES  THE  VALUES  OF  THE  GLOBAL  FUZZY 
VARIABLES  TO  CORRESPOND  TO  A  WORD'S  OPTIMUM  FUZZY  VARIABLES, 
WHICH  ARE  STORED  IN  FILE  FZOPT.  THE  VALUE  OF  NOWORD  INDICATES 
THE  APPROPRIATE  WORD, 


««*««»» ***«**«« *##*##  IK*###*####*###****#*##**#*##} 


PROCEDURE  GETOPTVARS (NOWORD {INTEGER) ; 
BEGINUGETOPTVARS*) 


SEEX(FZOPT,  NOWORD); 
CET(FZOPT); 
FLIMITssFZOPT*  ? 
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WITH  FL1HIT  DO 
BEGIN ( *WITH*> 

STHRs  =  FU2VARC 1 1 ; 
SUBE:=FUZVAR[23 ; 
SUBF:sFUZVARI3] ; 
INSE  s  =  FUZVARC43 ; 
INSF : =FUZVARC53 ; 
BELE  t  =  FUZVARC63 ; 
DELF:  =  FUZVARC73 ; 
DELGs=FUZVARC83 ; 
DCNE:=FUZVARC9] ; 
DCNF^FUZVARCIOI; 
DCN&:=FUZVARC113; 
SFE:=FUZVAR[121; 
SFFs  =  FUZVARC 133 ; 
CHVE;=FUZVARtl43; 
CHVFs=FUZVARC153 ; 
STATEi  =  FUZVARCU]; 
STATFs=FUZVARC173 ; 
STATG:-FUZVARC18]; 
THRlE:sFUZVAR[193; 
THR1F  s  =  FUZVAR[203 ; 
THR2E:=FUZVARC21]; 
THR2Fs*FUZVARt22]; 
END(*WITH») ; 

tor 


END( ^PROCEDURE  GETOPTVARS*) ; 


I#######***#*#**###*##*##*** ###*###»##*#*######**#**«**##«##»*##* 


PROCEDURE  RECOGSCR  DESCRIPTION 


THIS  PROCEDURE  COMPUTES  A  FIGURE  OF  MERIT  USED  IN  DETERMINING 
WHETHER  OR  NOT  A  CHANGE  IN  A  FUZZY  VARIABLE  CORRESPONDED  TO  AN 
INCREASE  IN  THE  RECOGNITION  ACCURACY,  THIS  FIGURE  IS  CALCULATED 
USING  THE  RECOGNITION  RESULTS  STORED  IN  FILE  SFILE. 


#**«###*«*#**#**#***#**#*#*#**#**#*#*#**»#*»****###*#**#»#»#*#»***»} 
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PROCEDURE  RECOGSCRt VAR  RECSCR  i REAL ;  WNUP1B :  INTEGER ) ; 


CONST 

FV1=0*6; 
FV2  =  0 .25 ; 
FV3=0.05; 
FV4=0. 10; 


BEGIN ( *RECOGSCR* ) 

WRITELNI ' RECQGSCR' ) ; 

SEEK<SFILE,WNUHB); 

GET  < SF I LE ) ; 

WSTATS:sSFILE* ; 

WITH  USTATS  DO 
BEGIN ( *WITHK  > 

IF  (NOCORRECTOO)  THEN 
BEGIN(»IF*) 

RECSCR :  =  FV1«<N0C0RRECT/N0ATTEHPT )  +  FV2»  <  TOTDI FF/NOCORRECT )  + 
FV3# ( HI ND I FF)+FV4M TOT ALSCQRE/NQ ATTEMPT) ; 

END ( *  I F  * ) 

ELSE 

BEG1NUELSE*) 

RECSCR : s  0 ; 

END(*ELSE*) ;  . 

END(«UITH*) ; 


ENDUPROCEDURE  RECOGSCR*); 


<»»»*««»»»»»»*»»»««»»««*»»»»»»»»*»«»*»»»»»«»»«»****»* »*x« «**»*«««»» 


PROCEDURE  GETLIMITS  DESCRIPTION 


THIS  PROCEDURE  RETRIEVES  THE  INITIAL  FUZZY  VARIABLE  VALUES, 

n ax i nun  limits,  and  minimum  linits  which  are  used  to  initialize 

A  HEW  WORD'S  OPTIHUH  FUZZY  VARIABLES. 


HIHHIHHOHHmoiHOIOOHMIKimiimHtiOHmHHHHl} 


PROCEDURE  GETLIHITS; 

VAR 

I : INTEGER; 

BEGIN(*GETLIHITS») 

RESET(OPTFUZZY) ; 

WITH  FLIHIT  DO 
BEGIN(«WITH») 

FOR  I :=1  TO  NOFUZZYVARS  DO 
BEGIN ( *FOR* ) 

READLN  (OPTFUZZY  ,FUZVARCI  3  ,nAXLIHITC  I  D,HINLIHITi:iI#FUZNAMECI3) 
END ( «FOR * ) ; 

RECSCORE:sO; 

END ( *UITH») ; 

CLOSE(OPTFUZZY); 

END( •GETLIMITS*) ; 


{»*«*####**#*#***#*#***#»#**»***#***#**»*#**»#***»#***#*##«*#****## 


PROCEDURE  SETLIHITS  DESCRIPTION 


THIS  PROCEDURE  INITIALIZES  THE  OPTIHUR  FUZZY  VARIABLES  FOR 
ALL  WORD'S  WHOSE  VARIABLES  WERE  NOT  PREVIOUSLY  INITIALIZED. 


**»«** t*#*#«*t ####»***«*»******#***«*#***«**#****«»*********«« #*«**} 


PROCEDURE  SETLIHITS; 

VAR 

I #WORDCNT  s INTEGER; 

BEGINt  KSETLIMITS* ) 

RESET (WFILE) ; 

NOWORD:=0; 

RESET ( FZOPT ) ; 

WHILE  NOT  EOF(FZOPT)  DO 
BEGINUWHILE*) 

SEEK<  FZOPT , NOWORD); 
GET  <  FZOPT ) 5 
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GET  <  FZOPT ) ; 

NOUORD :  =  N0UORD+1 ; 

END(*UHILE*>‘, 

NOUORD :  =  NOWORD-l ; 

U0RDCNT:=0; 

RESET(UFILE) ; 

WHILE  NOT  EOF(UFILE)  DO 
BEGIN! *UHILE*  > 

SEEX(WFILE, UORDCNT); 

GET(UFILE) ; 

GET(UFILE); 

UORDCNT :=HORDCNT+l ; 

END ( *WHILE* ) ; 

UORDCNT :  =  UORDCNT- 1 ; 

IF  UORDCNT)NOUORD  THEN 
BEGIN(*IF*) 

IF  NOUORD>-l  THEN 
BEGIN! *IF* ) 

SEEK (FZOPT ; (NOUORD) ) ; 
GET(FZOPT) ; 

END(#IF«); 

FOR  I :- (NGUORD+1 )  TO  UORDCNT  DO 
BEGIN(»FOR«) 

GETLIHITS; 

FZOPT* :=FLIMIT; 

PUT(FZOPT) ; 

SEE1UFZOPT  >1) ; 

GET(FZOPT); 

END ( *FOR* ) ; 

END ( * I F* ) 5 

ENDUSETLIHITS*) ; 


{»*»##»*#»#»#***#**#*»***#**«**#**#*#**»****»#***«**#****##»****#* 


PROCEDURE  OPTRECOG  DESCRIPTION 


THIS  PROCEDURE  IS  IDENTICAL  TO  PROCEDURE  UORDRECOG,  EXCEPT  THAT 
NO  RESULTS  ARE  STORED, 


****«»} 


PROCEDURE  OPTRECOG(WNUHB: INTEGER) ; 


NOUORD  f I : INTEGER; 

HIGHSCORE  >NEXTHIGHEST:REAL; 
HIGHWORDjSTRINGCSPELLENGTH]  ; 
FLAG:BQOLEAN; 


BEGIN ( *OPTRECOG* ) 

WRITELNCOPTRECOG' ); 

SEEK(WF1LE,WNUMB>; 

GET(WFILE) ; 

WDATA:=WFILE* ; 

WORD :-WD AT A. SPELLING; 

URI TELN ( ' WORD  BEING  EXAMINED  =  ',UORD); 

SEEK ( SF I LE jWNUMB) ; 

GET(SFILE) ; 

WSTATS:sSFILE* ; 

Unitialize  nord's  recognition  statistics.) 

WITH  WSTATS  DO 
BEGIN  («WITH«) 

NOCORRECT: =0; 

N0ATTEHPT:=0; 

T0TDIFF:*-0.0; 

HINDIFFjU.O; 

T0TALSC0RE:=0.0; 

SEEK ( SFILE ,UNUNB  > ; 

SF1LE‘:*WSTATS; 

PUT  (SFILE) 

END  (*WITH«); 

F1LENAHE:  =  C0NCAT( SPEAKER , ' RECOG' ) ; 
FILETITLE(SPECH,FlLENAPiE) ; 

RESET(SPECH); 

BEGINRECj=0; 

RESET(SPECH) ; 

WHILE  NOT  EOF(SPECH)  DO 
BEGIN(*WH1LE») 

SEEK(SPECH,BEG1NREC); 

GET(SPECH) ; 

IOUT:sSPECH* ; 

IF  10UT «SPELL  =  UORD  THEN 
BEGINUIF*) 

HIGHSCORE : =0 . 0; 

HIGHWORDi= ’ XXXXX*  ; 

NEXTHIGHEST ;=0.0; 

RESET ( WF ILE  > ; 
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tF 


N0W0RD:=0; 

WHILE  HOT  EOF(WFILE)  DO 
BEGIN ( «UHILE* ) 

SEEKWFILE  ,NOUORD) ; 

GET ( UF I LE ) ; 

WDATA ;SWFILE* ; 

WORD  :  =  WDATA  SPELLING; 

WRITELN; 

URI TELN  ( ' . ') 

URITELN  < '  WORD  =  '  ,IOUT.  SPELL,'  CONTINUATION  *  -  ‘  ,WNUHB); 

URITELN< 'WORD  ATTEMPTED  =  ',WORD>; 

GETOPTVARS (NOWORD) ; 

SCOREWORD; 

WRITELNl ' UORDSCORE  =  ' ,WORDSCORE> ; 

WRITELNC . 1 ) 

IF  NOWORD=WNUHB  THEN 
BEG IN ( » IF* ) 

SEEK(SFILE,WNUHB); 

GET(SFILE) ; 

USTATSs-SFILE* ; 

WSTATS.TOTALSCORE:=WSTATS.TOTALSCORE+WORDSCORE; 

SEEK ( SF ILE , WHUHB ) ; 

SFILE* :*WSTATS; 

PUT(SFILE); 

END < *IF« ) ; 

(Determine  highest  scoring  word,  and  score  of  the 
second  highest  aord  score •> 


N 


n 


IF  (WORDSCORE< :HIGHSCORE)AND(WORDSCORE)NEXTHIGHEST )  THEN 
BEGINUIF*) 

NEXTHIGHEST : =W0RDSC0RE ; 

ENDUIF*); 

IF  ( (WORDSCORE) )HIGHSCORE)  THEN 
BEGIN(*IF*> 

NEXTHIGHEST: 5  HI GHSCORE; 

H1GHSCORE: UORDSCORE; 

HIGHWORD;=WORD; 

END(*IF«>; 

SEEK (WFILE,  NOWORD); 

GET(WFILE); 

GET(WFILE) ; 

NOWORD:  =  NOWORD+1 ; 

END  ( *WH1LE* ) ; 

FLAG :  =  TRUE ; 

WHILE  ( (FLAG* TRUE) OR <IOUT«CONTFLAG:l) )DO 
BEGIN  («WHILE«) 

FLAG :  =  FALSE ; 

SEEK(SPECH,BEGINREC) ; 

GET(SPECH) ; 

IOUT:sSPECH* ; 

BEG1NREC:  =  BEGINREC  +  1 ; 

END  («UHILE»); 

GET  <  SPECH ) ; 
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SEEK(UFILE,UNUHB>; 

CET(UFILE) ; 

WDATA:*WF1LEA; 

WORD :=WD AT A. SPELLING; 

{Update  nerd's  recognition  statistics.) 

SEEK ( SF1 LE  ,WNUNB) ; 

GET(SFILE); 

USTATS:=SFILEA ; 

WSTATS . NOATTEHPT  s=WSTATS .N0ATTEHPT  +  1 ; 

IF  HIGHWORD*WDATA. SPELLING  THEN 
BEGIN(#IF») 

WSTATS. NOCORRECT:*USTATS.NOCORRECT+1; 

WST ATS. TOTDI FF  t  =  WSTATS .TOTDIFF+ (HIGHSCORE-NEXTHIGHEST) 
IF  ( HI GHSCORE-NEXTHI CHEST )< WSTATS. HIND  IFF  THEN 
BEG1N(*IF*) 

WSTATS. HIND  IFF;* HI GHSCORE-NEXTHI CHEST; 

END  (*IF«>; 

ENDUIF#); 

SEEXtSFILE ;UNUHB) ; 

SFILEA;*WSTATS; 

PUT(SFILE); 

END ( » I F* ) 

ELSE 

BEGIN(*ELSE») 

BEGINREC:*BEGINREC+1; 

GET(SPECH) ; 

END ( *ELSE » ) ; 

ENDUWHILE*); 

CLOSE ( SPECH ) ; 

END  (*PROCEDURE  OPTRECOC»); 


PROCEDURE  AUTORECOG(VAR  RECSCR-.REAL) ; 
BEGIN 

SEEK ( FZOPT  ,WNUHB) ; 

FZOPTA:*FLIHIT; 

PUT(FZOPT) ; 

OPTRECOG(WNUHB) ; 
RECOGSCR(RECSCR,WNUHB); 

SEEX(FZOPT /WNUHB) ; 

GET(FZOPT) ; 

FLIHIT  s  =  FZOPTA  ? 

ENDUPROCEDURE  AUTORECOG*) ; 


BEGIN (*FUZZYOPT*) 


IF  NOPT I ONS  = 1  THEN 
BEGINUIF*) 

GETLIMITS; 

{Get  names  of  fuzzy  variables  to  be  optimized.) 
WRITELN; 

WRITELN ( ' ENTER  "YES"  NEXT  TO  THE  FUZZY  VARIABLE') 
WRITELN! ‘ NAMES  TO  BE  OPTIMIZED. ') ; 

FOR  1 8*1  TO  NOFUZZYVARS  DO 
BEGIN ( *FOR* ) 

HRITE(FLINIT.FUZNAHEtn,'?  :  '); 
READLN<OPTANSWRtI)>; 

END(»FOR*); 

WRITELN; 

END(*IF») 

ELSE 

BEGIN(»ELSE») 

FOR  1 5*1  TO  NOFUZZYVARS  DO 
BEGINUFOR*) 

OPTANSWRII) :  = *  YES' ; 

END(*FOR* ) ; 

ENDUELSE#)  ; 

SETLIHITS; 

RESET(FZOPT) ; 

RESET(SFILE) ; 

RESET(WFILE); 

WNUMB:=0; 

WRITELN; 

SEEX(WFILE  jWNUHB) ; 

GET(WFILE) ; 

{Compute  initial  recognition  statistics  for 
each  mord  being  examined. > 

WHILE  NOT  EOF(UFILE)  DO 
BEGINUWHILE*) 

IF  OPERATEtWNUMB  +  13* ' YES'  THEN 
BEG I N ( *  I F  * ) 

OPTRECOG(WNUMB) ; 

END ( *  I F* ) ; 

SEEK ( WF I LE  ,WNUMB) ; 

GET(WFILE); 

GET(WFILE) ; 

WNUMB:*WNUHB+1; 

END(»WHILE*) ; 

QUIT5*FALSE; 

REPEAT 

FOR  PINT: *1  TO  NOFUZZYVARS  DO 
BEGINUFOR*) 


IF  0PTANSWREFINT3  =  YES'  THEN 
BEGINUIF*) 

UNUHB::0; 

RESET (WPILE) ; 

WHILE  NOT  EOF(UFILE)  DO 
BEGIN(»WHILE*> 

IF  OPERATEC WNUHB+1 ]  = ' YES  *  THEN 
BEGIN ( * IF»  > 

{Coapute  initial  Figure  of  Merit  for  word.) 

RECOGSCR (SAVE SCORE ,UNUPIB) $ 

RECSCR1 :=SAVESCORE; 

GETOPTVARS(WNUnB); 

{Change  next  fuzzy  variable  to  be  evaluated, 

and  adjust  its  value  and  I  i at i t s  based  on  the  word's 

recognition  statistics  using  the  variables  new  value.} 

WITH  FLIHIT  DO 
BEG1NUWITH*) 

SAVEVAR:  =  FUZVAR[FINTI ; 

FUZVARCFINT3 := ( FUZVARCFINT3+HINLIHITCFINT3 )/2; 
AUT0REC0G(RECSCR2) ; 

IF  RECSCR2)RECSCR1  THEN 
•  BEG IN(*IF*) 

NAXLIflITCFINT3i=SAVEVAR; 

RECSCR1 : =RECSCR2 ; 

END ( *  I F* ) 

ELSE 

BEGINUELSE*) 

HINUMITCFINT}:=FUZVARLFINT]; 

IF  SAVEVAR<  ( HAXL1  III TEF INTI -0.5)  THEN 
BEGINUIF*) 

FUZVAR[FINT3:sSAVEVAR*0«5; 

END(»IF»> 

ELSE 

BEGIN (*ELSE«) 

FUZVARIFINT3 ;= ( SAVEVAR+RAXLINITCFINT3 )/2; 

END ( *ELSE« ) ; 

AUT0REC0G(RECSCR2) ; 

IF  (RECSCR2>RECSCR1)  THEN 
BEGIN(*1F») 

HINLIMITCFINT3;=SAVEVAR; 

RECSCR1:=RECSCR2; 

END  t  *  I F* ) 

ELSE 

BEGINUELSE*) 

HAXLIH1TCFINT] ?  *FUZMAREFINT] ; 

FUZVARCFINT3 : =SAVEVAR; 

END(*ELSE»); 

END ( «ELSE w ) ; 

RECSCORE  s-RECSCRl ; 

SEEK ( FZOPT |UNUf1B) ; 
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FZOPT* :sFLIHIT; 
PUT(FZOPT); 
ENDUWITH*); 

END ( * I F» ) ; 

SEEK ( UFI LE  ,WNUHB) ; 

GET  < WFILE) ; 

GET(WFILE) ; 

WNUP1B  s  =  WNUNB  + 1 ; 
END(*WHILE») ; 

END(*IF*)$ 

ENDUFOR*); 

UNTIL  (QUIT=TRUE); 

ENDUPROCEDURE  FUZZYOPT#); 


{«****#*****«***«******«*»«***************************************** 


PROCEDURE  AUTOHAT  DESCRIPTION 


THIS  PROCEDURE  PROVIDES  THE  FOLLOWING  OPTIONS: 

0.  EXECUTE  UP  TO  "HAXNOPTIONS"  OF  THE  FOLLOWING  OPTIONS. 

(OPTIONS  2,  4,  5,  AND  6  DEFAULT  TO  OPERATE  ON  ALL  WORDS, 

AND  OPTION  6  DEFAULTS  TO  OPERATE  ON  ALL  VARIABLES  WHEN  THIS 
OPTION  IS  USED.  THE  REGAINING  OPTIONS  HAVE  NO  DEFAULT 
VALUES,  AND  REQUIRE  USER  INPUT  BEFORE  THEY  CAN  BE  EXECUTED.) 

1.  STORE  A  NUHBER  OF  SPEECH  FILES  FOR  FUTURE  USE. 

2.  COHPUTE  THE  WORD  RECOGNITON  RESULTS  FOR  A  SET  OF  SPECIFIED 
WORDS,  USING  THE  SPEECH  FILES  STORED  IN  FILE  RECOG, 

3.  COHPUTE  A  GIVEN  WORD’S  RECOGNITION  RESULTS  USING  THE  SPEECH 
FILES  STORED  IN  FILE  RECOG. 

4.  INITIALIZE  THE  WORD  STATISTICS  FOR  A  SET  OF  SPECIFIED 
WORDS,  USING  THE  SPEECH  FILES  STORED  IN  FILE  SPECH, 

5.  DETERHINE  THE  PHONEHE  REPRESENTATIONS  FOR  A  SET  OF  SPECIFIED 
WORDS,  USINC  THE  SPEECH  FILES  STORED  IN  FILE  SPECH, 

6.  OPTIHIZE  THE  SPECIFIED  FUZZY  VARIABLES  FOR  A  SET  OF 
SPECIFIED  WORDS,  USING  THE  SPEECH  FILES  STORED  IN  FILE  RECOG. 

7.  CHANGE  THE  OPTIMUM  FUZZY  VARIABLES  FOR  A  GIVEN  WORD,  OR 
CHANGE  THE  INITIAL  OPTIHUH  FUZZY  VARIABLES  STORED  IN 
FILE  OPTFUZZY. 
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PROCEDURE  CONSTANTS 


CONSTANTS 


HAXNOPTIONS 


DESCRIPTION 


THE  HAXlHUfl  NUMBER  OF  OPTIONS  THAT  CAN  BE 
EXECUTED  CONSECUTIVELY. 


***#»***#********#*##**#***#*****************»*####»######**##**#»##} 


SEGMENT  PROCEDURE  AUTOHAT; 
CONST 

HAXNOPTIONSM; 


1 >NOPT IONS: INTEGER; 

AOPTIONiARRAYCl.. HAXNOPTIONS]  OF  INTEGER; 
NOWORDsINTEGER; 

STRRECORD: INTEGER; 


{»*#****#*»******#***»»**********»*#*#**#»»*#»»*#**#***»#»*#»#»##*#** 


PROCEDURE  GETOPWORDS  DESCRIPTION 


THIS  PROCEDURE  ALLOWS  THE  USER  TO  SPECIFY  THE  WORDS  THAT  ARE 
TO  BE  OPERATED  ON. 


»***#***»**»**»*******##»»»*»*##»*#* ***#####»**##»**#*»*#«# *#»#*#*#»###*} 


PROCEDURE  GETOPWORDS; 
VAR 

1 ^UORDCNT : INTEGER ; 
ANSWER sSTRING[3] ; 
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BEG1N(*GET0PW0RDS«) 


U0RDCNT:=1; 

ANSWER:5 ' YES' ; 

IF  NOPTI0NS5 1  THEN 
BEGIN(*IF») 

WRITELN ( ' ENTER  "YES"  NEXT  TO  THE  WORDS  TO'); 
WRITELN! 'BE  OPERATED  ON.'); 

WRITELN; 

UR I T E < ' OPERATE  ON  ALL  WORDS?  :  '); 

READLNt ANSWER) ; 

IF  ANSWERO'YES'  THEN 
BEGIN ( *IF«) 

W0RDCNT:50; 

RESET ( UF HE ) : 

WHILE  NOT  EOF(UFILE)  DO 
BEGINUWHILE*) 

SEEK ( WF I LE ,W0RDCNT ) ; 

GET(WFILE) ; 

WDATA:5WFILE* ; 

WRITEtWDATA. SPELLING,'?  :  '); 

READLN! OPERATE IW0RDCNT+ II ) ; 

GET(WFILE) ; 

WORDCNT :  =  UQRDCNT+1 ; 

END(«WHILE») ; 

WORDCNT:=WORDCNT+1; 

END C  «IF* ) ; 

END(»IF») ; 

FOR  I :5WORDCNT  TO  HAXNOWORDS  DO 
BECIN(«FOR«) 

OPERATECI3:=ANSWER; 

END(«FOR«); 

END! ‘PROCEDURE  GETOPWORDS*) ; 


{IHHIHIOHiHHHHHHiHHHOMHHIIIIIIIIHiHHimitmmOl 


PROCEDURE  INITALL  DESCRIPTION 


THIS  PROCEDURE  INITIALIZES  THE  WORD  STATISTICS  FOR  A  SET  OF 
SPECIFIED  WORDS,  USING  THE  SPEECH  FILES  STORED  IN  FILE  SPECH. 

IF  OPTION  0  IS  USED,  THE  WORD  STATISTICS  FOR  ALL  THE  WORDS 
IN  THE  VOCABULARY  WILL  BE  INITIALIZED. 


PROCEDURE  I NI TALL ; 

VAR 

J,INT:INTEGER; 

BEGIN ( *INITALL« ) 

URITELN ( ' INI  TALL ' ) ; 

IF  NOPTION3= 1  THEN 
INITFILE; 

GETOPWORDS; 

GETFUZZYVARS; 

RESET(SFILE); 

KESET(WFILE) ; 

N0W0RD:=0; 

WHILE  NOT  EOF ( WFILE)  DO 
BEGINUUHILE*) 

IF  OPERATE! NO WORD+l]s'YES'  THEN 
BEG I N ( *  I F  * ) 

INITST AT ( NOWORD ) ; 

END ( *IF* ) ; 

SEEK (WFILE jNOUORD ) ; 

GET(WFILE); 

GET(WFILE); 

NOWORD ;  =  NOWORD  +  1 ; 

END <  * WH I LE* ) ; 

END( ^PROCEDURE  I NIT  ALL*  > ; 


!«»»**#««*»»»»»»«»**»*»*»»»««»»»»»»»»»»**»»»»»»»*»***«**««««»**«**»» 


PROCEDURE  RECOGWORD  DESCRIPTION 


THIS  PROCEDURE  COMPUTES  A  GIVEN  WORD’S  RECOGNITION  RESULTS 
USING  THE  SPEECH  FILES  STORED  IN  FILE  RECDG. 


***#*#«*##**#»#*###*#*#**##**#**#*#*####*##***#»##*#**#«##**«»###»*#«} 


PROCEDURE  RECOGWORD; 


I : INTEGER ; 

TEHPWORD: STRING! SPELLENGTHI ; 


BEGIN  <  *RECOGUGRD» ) 

INITFILE; 

WRITELN; 

WRITELN  ('INPUT  EXACT  SPELLING  OF  WORD  TO  BE  RECOGNIZED  '); 
WRITE  ('WORD  =  '); 

READLN  (WORD); 

FOR  I := (LENGTH(WORD) )  TO  (SPELLENGTH-1 )  DO 
BEGIN ( *FOR* ) 

WORD :  =  CONC AT ( WORD , '  '); 

END( *FOR*) ; 

WRITELN  ; 

RESET(SFILE) ; 

RESET  (WFILE) ; 

TEHPWORD  :s ‘  '; 

WNUHB:=0; 

WORDFLAG:-0; 

WORDCNT:sO; 

WHILE  (NOT  EOF(WFILE) ) AND (WORD <  > TEHPWORD)  DO 
BEGIN  UWHILE*) 

SEEK (  W;'ILE ,WNUHB) ; 

GtT(WFILE) ; 

WDATA:=WFILEA ; 

GET(WFILE) ; 

TEHPWORD :=WD AT A. SPELLING; 

WNUHB:=WNURB+  1 
END  <«WHILE«); 

.  WNUHB:=WNUHB*1 ; 

IF  (WORDOTEHPUORD)  THEN 
BEGIN  <*IF») 

WRITELN; 

WRITELN  (‘WORD  DOES  NOT  EXIST.  *); 

END  (*IF«) 

ELSE 

BEGIN  (»ELSE*) 

REURITE(RESULTS); 

WORDRECOG(WNUHB) ; 

CLOSE(RESULTS); 

ENDUELSE*) ; 

WRITELN! 'THE  RECOGNITION  RESULTS  ARE  STORED  IN  "RESULTS".'); 
EHD(«PROCEDURE  RECOGWORD*); 
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{M»o»*«»»»***»tt«*»**»**»*»**»»»»*****»x»****x  »«***»*«*******»***»» 


PROCEDURE  RECOGALL  DESCRIPTION 


THIS  PROCEDURE  COMPUTES  THE  RECOGNITION  RESULTS  FOR  A  SET  OF 
SPECIFIED  WORDS,  USING  THE  SPEECH  FILES  STORED  IN  FILE  RECOG, 

IF  OPTION  0  IS  NOT  USED,  THIS  PROCEDURE  CAN  BE  RESTARTED 
AT  A  GIVEN  CONTINUATION  NUMBER. 


xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx} 


PROCEDURE  RECOGALL; 

BEG1NURECOGALL*) 

WORDFLAG :  =  0 ; 

WORDCNT :=0; 

IF  NOPTIONS= 1  THEN 
BEG I N  <  *  I F  * ) 

INITFILE; 

URITELN ( ' IF  YOU  ARE  CONTINUING  A  PREVIOUS  RECOGNITION  RUN;'); 
WRITELN; 

WRITELN( 'ENTER  THE  "CONTINUATION  NUMBER” ;  ELSE  '); 

WRITELN! '  ENTER  "O".'  ); 

WRITE ( ' CONTINUATION  #  =  '); 

READLN(UNUMB) ; 

WRITELN; 

WRITELN! ' ENTER  THE  "WORD  COUNT”;  ELSE  ENTER  “O”.'); 

WRITE! ' WORD  COUNT  -  '); 

READLN(WORDFLAG); 

WRITELN; 

END(*IF*> 

ELSE 

BEGINUELSE*) 

WNUMB;=0; 

END ( *  ELSE*  ) ; 

GETOPWORDS; 

RESET(WFILE); 

RESET(SFILE); 

IF  (W0RDFLAG:0)  AND  (WNUHB’O)  THEN 
BEGINUIF*) 

REWRITE(RESULTS); 

END ( *  I F* ) 

ELSE 

BEGINUELSE*) 

APPEND(RESULTS) ; 

ENBUELSE*) ; 

SEEKIWFILE,WNUMB) ; 
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GET(I)FILE) ; 

WHILE  NOT  EOF(UFILE)  DO 
BEGINUWHILE*) 

IF  OPERATECWNUMB+II1 ' YES'  THEN 
BEGIN(*IF«) 

WORDRECOG(UNUHB) ; 

END ( *  I F  * ) ; 

SEEK( WFILE ,UNUMB) ; 

GET(WFILE)  ; 

GET(WFILE) ; 

W0RDFLAG:=O; 

WORDCNT:sO; 

WNUNB:£WNUMB+1 ; 

END(*WHILE«); 

CLOSE ( RESULTS )  ; 

WRITELNt ' THE  RECOGNITION  RESULTS  ARE  STORED  IN  "RESULTS1*, ' ) ; 
ENDUPROCEDURE  RECOGALL*); 


I*#**#***#*********####****#*#**#*###****#*###*####**##***##*#**#### 


PROCEDURE  DETPHALL  DESCRIPTION 


THIS  PROCEDURE  DETERMINES  THE  PHONEME  REPRESENTATIONS  FOR  A  SET 
OF  SPECIFIED  WORDS,  USING  THE  SPEECH  FILES  STORED  IN  FILE  SPECH, 

IF  OPTION  0  IS  NOT  USED,  THE  USER  MUST  ENTER  THE  WORDS  TO 
BE  OPERATED  ON,  THE  NUMBER  OF  VECTORS  PER  WORD  PHONEME,  AND  THE  MAXIMUM 
NUMBER  OF  ITERATIONS  (NOITERATIONS)  TO  BE  PERFORMED,  ALSO,  THIS 
PROCEDURE  CAN  BE  RESTARTED  AT  A  GIVEN  CONTINUATION  NUMBER,  HOWEVER, 

IF  OPTION  0  IS  USED,  THE  PHONEME  REPRESENTATION  OF  ALL  THE  WORDS  IS 
DETERMINED,  THE  NUMBER  OF  PHONEME  VECTORS  PER  WORD  (NOPHVECTORS)  DEFAULTS 
TO  THE  VALUE  OF  “NOTOAVG" ,  AND  THE  VALUE  OF  NOITERATIONS  DEFAULTS 
TO  2. 


*»***###**#*##*»*#*###»#******##**##«*#*»**#******»»****##*#*****#**«#*} 


PROCEDURE  DETPHALL; 

VAR 

NOPHVECTORS : INTEGER ; 
NOITERATIONS, ICOUNT  s INTEGER; 


BEGIN!*DETPHALL») 


IF  N0PTI0NS=1  THEN 
BEGINUIF*) 

INITFILE; 

WRITELNt'lF  YOU  ARE  CONTINUING  A  PREVIOUS  RECOGNITION'); 
URITELN ( ’ RUN  ENTER  THE  "CONTINUATION  NUMBER" ;  ELSE  '); 
WRITELN ( ' ENTER  “O".'); 

WRITE! ‘ CONTINUATION  #  *  '); 

READLN(NOUORD); 

URITELN; 

URITELN; 

URITELN! 'ENTER  THE  NUMBER  OF  VECTORS  PER  UQRD  PHONEME.'); 
URITEC#  VECTORS  =  '); 

READLN(NOPHVECTORS); 

URITELN; 

URITELN! ' ENTER  THE  NUMBER  OF  ITERATIONS  TO  BE  PERFORMED.') 
URITECNO.  ITERATIONS  =  '); 

READLN(NOITERATIONS); 

URITELN; 

END ( *1 F* ) 

ELSE 

BEGINUELSE*) 

N0U0RD:-0; 

IF  NOTOAVG  =  1  THEN 
BEGIN!*IF*) 

NOPHVECTORS;--l; 

END ( * I F  * ) 

ELSE 

BEGIN! *ELSE«) 

NOPHVECTORSj* NOTOAVG; 

END ( »ELSE* ) ; 

N0ITERATI0NS:=2; 

END(»ELSE»> ; 

GETOPUORDS; 

RESET(SFILE) ; 

RESET(WFILE); 

FIlETITLE!STATSOUT ,' PHREPDAT' ) ; 

REURITE(STATSOUT) ; 

GETFUZZYVARS; 

SEEK!WFlLE,NOUORD); 

GET(UFILE) ; 

WHILE  NOT  EOF(UFILE)  DO 
BEGINUWHILE#) 

IF  OPERATECNOWORD+II* ' YES'  THEN 
BEGIN! « IF* ) 

URITELN! 'DETPHALL  CONTINUATION  NUMBER1  ' ,NOUORD); 
URITELN; 

OPERATION:* 'AUTODET' ; 

AUTODET ( NOWORD, NOPHVECTORS,CHNGEFLAG>; 

lNITSTAT(NOWORD) ; 

lCOUNT:sO; 

REPEAT 

ICOUNT:*ICOUNT+l: 


CHNGEFLAG  s-FALSE; 

OPERATIONS1 ' CORRECTR"  ; 

AUTODETtNOWORD , NQPH VECTORS ,CHNGEFLAG) ; 
INITSTAT(NQUORD) ; 

OPERATION:5' ADJUSTDE'; 

AUTODETtNOWORD ,NOPH VECTORS, CHNGEFL AG) ; 
INITSTAT  tNOWORD) ; 

UNTIL  t ICGUNT5N01TERATI0NS)0RtCHNGEFLAG5FALSE)  ; 
END  <  *  I F» ) ; 

SEEK (WFILE, NO WORD) ; 

GET(UFILE) ; 

GET(UFILE) ; 

NOUORD:5NOWORD+l; 

END ( *WH I LE* ) ; 

CLOSE(STATSOUT); 

END(*PROCEDURE  DETPHALL*); 


BEGIN  (*AUTOMAT») 

OPERATION:5' . ' ; 

WRITELN; 

WRITELN; 

WR1TELN( 'CHOOSE  OPTION  FROM  FOLLOWING  LIST')* 

WRITELN; 

WRITELNt '0  -  EXECUTE  UP  TO  ' ,HAXNOPTIONS, '  CONSECUTIVE  OPTIONS.') 
WRITELNt ‘ 1  -  STORE  SPEECH  FILES.'); 

WRITELNC2  *  AUTOMATIC  WORD  RECOGNITION  FOR  MULTIPLE  WORDS.'); 
WRITELNt ' 3  -  AUTOMATIC  WORD  RECOGNITION  FOR  A  GIVEN  WORD.'); 
WRITELNt ' 4  -  INITIALIZE  MULTIPLE  WORD  STATISTICS.'); 

WRITELNt '5  -  DETERMINE  MULTIPLE  WORD  PHONEME  REPRESENTATIONS.'); 
WRITELNt' A  -  OPTIMIZE  THE  FUZZY  VARIABLES.'); 

WRITELNt '7  *  CHANGE  THE  OPTIMUM  FUZZY  VARIABLES.'); 

WRITELN; 

WRITEt ' OPTION  5  '); 

READLNt AOPTIONC1I ) ; 

WRITELN; 

NOPTIOMSs  =  1 ; 

IF  A0PT10N113  5  0  THEN 
BEGINt*IF*) 

REPEAT 

WRITELNt 'ENTER  OPTION  « ' ,NOPTIONS , ' ;  OR  ENTER  0,'); 

WRITEt ' OPTION!'  , NOPTIONS, '3  5  '); 

READLN (AOPTI ON [NOPTIONS 3 ) ; 

NOPTIONS  s  =N0PTI0NS+1 ; 

UNTIL  ( (AOPTIQN [NOPTIONS- 1350)OR( NOPTIONS5 (HAXNOPT I ONS+1 ) ) ) ; 
NOPTIONS: 5 NOPT IONS- 1; 

INITFILE; 

END ( *  I F» ) ; 

FOR  I s  =  1  TO  NOPTIONS  DO 
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BEGIN ( »FOR*  > 

IF  AOPTIONCI ]  =1  THEN 
BEGINOIF*) 

STORESPEECH ; 

END  t  *IF* ) 

ELSE  IF  AOPTIONCI]  =  2  THEN 
BEG1NUELSE*) 

RECOGALL; 

END ( *ELSE* ) 

ELSE  IF  AOPTIONCI!  *  3  THEN 
BEGIN ( *ELSE* ) 

RECOGUORD; 

ENDUELSE*) 

ELSE  IF  AOPTIONCI!  =  4  THEN 
BEGINUELSE*) 

INI  TALL ; 

END ( *ELSE* ) 

ELSE  IF  AOPTIONCI]  =  5  THEN 
BEGINOELSE*) 

DETPHALL; 

END ( «ELSE* ) 

ELSE  IF  AOPTIONCI]  =  6  THEN 
BEGINOELSE*) 

IF  NOPTIONSsl  THEN 
BEGINOIF*) 

INITFILE; 

ENDOIF*) ; 

GETOPUORDS; 

FUZZYOPT(NQPTIQNS) ; 

ENDUELSE*) 

ELSE  IF  AOPTIONCI]  =  7  THEN 
BEGINUELSE*) 

INI TF I LE ; 

CHNGEOPT; 

ENDOELSE*) 

ELSE 

BEGINUELSE*) 

URITELN; 

WRITELN ( 'OPTION  DOES  NOT  EXIST’) 
URITELN; 

ENDOELSE*) ; 

CLOSEtOPTFUZZY ) ; 

CLOSE(FZOPT); 

CLOSE ( UFILE) ; 

CLOSE ( SF I LE ) ; 

CLOSE ( SPECH ) ; 

ENDOFORx); 


END  (^PROCEDURE  AUTOHAT*): 


{XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 


PROGRAM  OUTSTAT  DESCRIPTION 


THIS  PROGRAM  OUTPUTS  ALL  THE  INFORMATION  STORED  IN  FILES 
SFILE  AND  WFILE  FOR  THE  SPECIFIED  WORD  (OR  WORDS).  THE 
FOLLOWING  TWO  OPTIONS  ARE  PROVIDED: 

1.  OUTPUT  THE  DATA  STORED  IN  FILES  SFILE  AND  WFILE 
FOR  A  GIVEN  WORD  TO  THE  FILE  INDICATED  BY  THE 
USER. 

2.  OUTPUT  THE  DATA  STORED  IN  FILES  SFILE  AND  WFILE 
FOR  ALL  THE  WORDS  IN  THE  VOCABULARY,  AND  GIVE 
EACH  OUTPUT  FILE  7HE  NAME  DERIVED  FROM  THE 
WORD'S  SPELLING  PRECEEDED  BY  AN  “S".  FOR  EXAMPLE, 
THE  OUTPUT  FILE  NAME  FOR  THE  WORD  ZERO  WOULD  BE 
"SZERO". 


THIS  PROGRAM  ALSO  STORES  THE  OVERALL  FUZZY  VARIABLES,  AND  THE 
WORD'S  FUZZY  VARIABLES  IN  EACH  OUTPUT  FILE. 


FOR  A  DESCRIPTION  OF  THE  DATA  STRUCTURES,  CONSTANTS,  AND 
VARIABLES,  REFER  TO  THE  DOCUMENTATION  AT  THE  BEGINNING  OF  PROGRAM 
LEARNWORD. 


XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKXXXXXXXXXKXXXXXXXXXXXXX} 


PROGRAM  OUTSTAT( INPUT, OUTPUT, SPECH, WFILE, SFILE, 

STATSOUT, FUZZY  VAR, QPTFUZZY  ,FZQPT ) ; 


CONST 

SPELLENCTH=20; 

HAXPH0NESs20; 

HAXPHPLUS1=21; 

NOPHONEMES-71? 

NOGUESSES-5; 

HAXN0W0RDSs50; 

RCRDLENGTHMO  < RCRDLEHGTH  MUST  BE  A  MULTIPLE  OF  TWENTY); 

NAXVECT0RS=80; 

NOTOAVG:5; 

BEG INUORD  =  1 ; 

NOFUZZYVARS-21 ; 


UORDDATA- RECORD 

SPELLING sSTRIMGCSPELLEHGTH]; 

PHREPsARRAYC 1 . . rtAXPHONESl  OF  INTEGER; 
WSTHR,HSUBE,USUBF,WINSE:REAL; 

UINSF fWDELE  ,HBELF,UDELG:REAL; 

WDCNE,WDCNF ,UDCNG:REAL; 

USFE,WSFF,UCHVE,UCHVF:REAL; 

WSTATE,WSTATF,WSTATG:REAL; 

HTHR1E,WTHR1F,WTHR2E,WTHR2F:REAL; 

END  («UORDDATA*>; 

WORDSTATS'RECORD 

NOSPOKE > NO ATTEHPT >NOCORRECT: INTEGER; 

NDEL : ARRAY!  1  <  . NAXPHQNES]  OF  INTEGER; 
XSUBY,XINAFTZ:ARRAYC1,.P1AXPH0NES4**N0PHQNE«ES]  OF  REAL 
TOTALSCORE:R£AL; 

TOTDIFFsREAL; 
tllNDIFF  sREAL ; 

END  UUORDSTATS*); 

STATSFI L  =  FI LE  OF  UORDSTATS; 

ISTRING  =  RECORD 

SPELLsSTRINGtSPELLENGTHI; 

CDNTFLAGilNTEGER; 

SPKRsSTRINGLIO] ; 

I  SCORE: ARRAY [1 . .RCRDLENGTH!  OF  REAL; 

IPHONE : ARRAY! 1 « .RCRDLENGTH]  OF  INTEGER; 

IVERRORrARRAYCl . , RCRDLENGTH]  OF  STRINCt53; 
ICHPHONEsARRAYtl,. NOGUESSES,  1 RCRDLENGTH]  OF  INTEGER; 
ICHVALUE:ARRAYC1.. NOGUESSES, 1.. RCRDLENGTH]  OF  REAL; 
ISFACTORsARRAYtl. .RCRDLENGTH]  OF  REAL 
END  (»ISTRING«); 


HORDSTRING^FILE  OF  ISTRING; 
HORDFILE5  FILE  OF  WORDDATA; 


FUZZ YLI HITS 5  RECORD 

FUZNAHE:ARRAYC1. .NOFUZZYVARS3  OF  STR1NGC53 ; 
FUZVAR,HINLIHIT,HAXLII1IT:ARRAY[l..NOFUZZYVARS3  OF  REAL; 
RECSCOREjREAL; 

ENDUFUZZYLIHITS#) ; 

FUZOPT  =  F I LE  OF  FUZZYLIHITS; 


OPTION: INTEGER; 
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WDATAiUORDDATA; 

WSTATS.-WORDSTATS; 

SPILE  tSTATSFIL ; 

SPECH : WDRDSTRI NG ; 

WFILE  sWORDFILE ; 

I  OUT 1 1  ST  RING ; 

FZOPTtFUZOPT ; 
FLIHIT:FUZZYLIHITS; 
MORDiSTRINGtSPELLENGTH]; 
ENDFLAG ;BODLEAN ; 
STATSOUTtTEXT; 

FUZZYVARsTEXT ; 

OPTFUZZY  :TEXT ; 
SPEAKER:STRINGC3] ; 

FILENAME: STRING [221; 
STHR,SUBE,SUBF:REAL; 

INSE  ,INSF ;REAL ; 
DELE,DELF,DELG:REAL; 
DCNE,DCNF,DCNG:REAL; 
SFE,SFF:REAL; 

CHVE,CHVF:REAL; 

STATE, STATF,STATG:REAL; 
THRIE,THR1F ,THR2E ,THR2F  :REAL; 


{**»***#*#***»*»**##»*#*»#»**#*»**###**»***********«*******»*******#**## 


PROCEDURE  INITFILE  DESCRIPTION 


THIS  PROCEDURE  ACCEPTS  THE  THREE  INITIALS  PRECEEDING  THE 
FILES  TO  BE  OPERATED  UPON,  AND  CONCATINATES  THEN 
WITH  THE  APPROPRIATE  FILE  NAMES.  THE  FOLLOWING  FILES  ARE  AFFECTED: 


WFILE 

SFILE 

SPECH 

FZOPT 


*ff * »i»««**»*»****« «**»««** ft *ff *»«*«« »**»*«*«*** *««*«*« K«K »****»«»*»»« «*«**} 


PROCEDURE  INITFILE; 
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BEGIN  (*INITFILE«) 


HRITELN; 

HRITELN; 

UR1 TELN ( ' ENTER  “THREE”  INITIALS  FOR  THE  CURRENT  «PERMANENT*  '); 
WRITELNt 'SPEAKER;  OR  ENTER  "ALL"  FOR  •NON-PERMANENT#  SPEAKERS.1); 
HRITE( ' SPEAKER  =  '); 

READLN(SPEAKER) ; 

URITELN; 

F I LENAME:=CONC AT (SPEAKER ,  'HFILE' ) ; 

FILETITL£(WFILE, FILENAME); 

FILENAME :=CONCAT< SPEAKER  ,'SFILE'); 

FILETITLE<SFILE FILENAME); 

FILENAME s-CONCAT (SPEAKER, ' SPECH' ); 

FILETITLEtSPECH, FILENAME) ; 

FILENAME :-CONCAT( SPEAKER, 'FZOPT' ); 

FILETITLE(FZOPT, FILENAME) ; 

END(«PROCEDURE  INITFILE* ) ; 


(a******************************************************************* 


PROCEDURE  GETFUZZYVARS  DESCRIPTION 


THIS  PROCEDURE  READS  THE  VALUES  OF  THE  OVERALL  FUZZY  VARIABLES 
STORED  IN  THE  FILE  FUZZYVAR. 


»***»###»#»##*##**#»#*»#«##*#*####***###########«****»##***#####**####} 


PROCEDURE  GETFUZZYVARS; 
BEG1N(»GETFUZZYVARS») ; 

RESET(FUZZYVAR) ; 

READLN( FUZZYVAR, STHR,SUBE,SUBF) ; 
READLN( FUZZYVAR, INSE,INSF); 

READLN( FUZZYVAR , DELE ,DELF,DELC) ; 
READLN( FUZZYVAR ,DCNE,DCNF  ,DCNG) ; 
KEADLN(FUZZYVAR  ,SFE  ,SFF) ; 

READLN( FUZZYVAR, CHVE,CHVF); 
READLN(FUZZYVAR,STATE,STATF  ,STATG) ; 
READLNt FUZZY VAR, THR IE ,THR1F) ; 


RE ADLN ( FUZZ Y VAR ,THR2E  ,THR2F) ; 


CIOSE(FUZZYVAR) ; 

END ( *PROCEDURE  GETFUZZYVARS*)  ? 


Ohhhhuhhhhhhhhhhhk##*##****#***************#*#*#*****#***#****##*** 


PROCEDURE  GETWORDVARS  DESCRIPTION 


THIS  PROCEDURE  INITIALIZES  THE  VALUES  OF  THE  GLOBAL  FUZZY 
VARIABLES  TO  CORRESPOND  TO  A  WORD'S  FUZZY  VARIABLES.  RECORD 
HDATA  RUST  CONTAIN  THE  APPROPRIATE  WORD'S  DATA  BEFORE  THIS 
PROCEDURE  IS  CALLED. 


«*»*»ft«»****»«««**«****««««««*X»«««««*»*»»«**«««**«******«**K««»»«»ft««} 


PROCEDURE  GETWORDVARS; 

BEG1NOGETUQRDVARS*) 

WITH  UDATA  DO 
BEGINUWITH*) 
STHR:=WSTHR; 

SUBE :  =  USUBE ; 

SUBF ;  =  WSUBF  ; 

INSE; - WINSE; 

INSF  5  = W I NSF ; 

DELE; -WDELE; 

DELF  s  =  WDELF ; 
DELC:=WDELG; 
DCNE:-UDCNE; 

DCNF i = WDCNF ; 

DCNG :  =UDCNG ; 

SFE :  =  WSFE ; 

SFF  s  =  WSFF ; 
CHVE:-WCHVE; 

CHVF :  =  WCHVF ; 
STATE:*WSTATE; 
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STATFs=WSTATF; 

STATQs  =  WSTATG ; 
THR1E:=UTHR1E; 

THRlFs  =  WTHR1F ; 

THR2E  s  =  WTHR2E; 
THR2F;=WTHR2F; 

END ( *WITH* ) ; 

ENDOFROCEDURE  GETUORDVARS*) ; 


{a**#****#*#*####*#*#**##*******#**###***########**####*##*#**###* 


PROCEDURE  AUTOOUT  DESCRIPTION 


THIS  PROCEDURE  OUTPUTS  THE  INFORMATION  STORED  IN  SF1LE  AND 
UFILE,  AND  THE  VALUES  OF  THE  OVERALL  AND  WORD  FUZZY  VARIABLES 
FOR  THE  WORD  SPECIFIED  BY  THE  VARIABLE  UORDNO. 


•IK**###*#**#*##*########***#***###*#*#####**###**###***#***#####*} 


PROCEDURE  AUTOOUT(UORDNQ:INTEGER) ; 

VAR 

AVESCORE:REAL; 

I  ,J:INTEGER; 

BEGIN(*AUTOOUT») 

SEEK(HFILE, UORDNO); 

GET(UFILE) ; 

WDATA  s  =  UFILEA ; 

HRITELN(STATSOUT) ; 

HRITELN(STATSOUT , ' THE  STATISTICS  FOR  ' ,WDATA. SPELLING, '  FOLLOW.') 
URITELN(STATSOUT) ; 

MRITELN( STATSOUT , ' THE  SPEAKER  IS  SPEAKER, '"') ; 
URITELN(STATSOUT) ; 

SEEKCSFILE, UORDNO) ; 

GET(SFILE) ; 

VSTATS: sSFILEA ; 

WITH  USTATS  DO 
BEGIN  <«UITH*> 
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WRITELN(STATSOUT,'#  OF  TINES  SPOKEN  =  ',NOSPOKE); 

WRITELN ( STATSOUT , ' «  OF  RECOGNITION  ATTEMPTS  =  ' ,NOATTEHPT) ; 
WRITELN(STATSOUT , ' #  OF  TINES  CORRECTLY  RECOGNIZED  =  ' ,NOCORRECT) ; 

IF  NOATTEHPTOO  THEN 
BEGIN  <*IF#) 

AVESCOREt=TOTALSCGRE/NGATTEHPT; 

WRITELN (ST ATSOUT , ' AVERAGE  RECOGNITION  SCORE  *  ' ,AVESCQREi8:3) ; 

END  (#IF#) 

ELSE 

BEGIN  <»ELSE») 

AVESC0RE:=0.0; 

WRITELNtSTATSOUT, 'AVERAGE  RECOGNITION  SCORE  =  ' ,AVESC0RE:4) ; 

END  (*ELSE«) ; 

WRITELN(STATSOUT) ; 

IF  NOCORRECTOO  THEN 
BEGIN  ( *  I F  * ) 

AVESCORE :=TOTDIFF/NQCORRECT; 

WR1TELN(STATS0UT,' AVERAGE  DIFFERENCE  BETWEEN  WORD  SCORE'); 

WRITELN (ST ATSOUT ,' AND  NEXT  HIGHEST  SCORE  -  ' ,AVESC0REi8:3) ; 

END  (*IF») 

ELSE 

BEGIN  (*ELSE») 

AVESC0RE:=0.0; 

WRITELN(STATSOUT, 'AVERAGE  DIFFERENCE  BETWEEN  WORD  SCORE'); 
WRITELN(STATSOUT ,' AND  NEXT  HIGHEST  SCORE  *  * ,AVESC0REi4:2) ; 

END  (»ELSE«); 

WRITELN(STATSOUT); 

WRITELN (ST ATSOUT »'NINIHUH  DIFFERENCE  BETWEEN  WORD  SCORE'); 

WRITELN (ST ATSOUT ,  ’ AND  NEXT  HIGHEST  SCORE  =  ',HINDIFF); 
WRITELN(STATSOUT) ; 

GETFUZZYVARS; 

WRITELN(STATSOUT); 

WRITELN(STATSOUT); 

WRITELN(STATSOUT ,'THE  OVERALL  FUZZY  VARIABLES  THAT  WERE  USED  FOLLOW') 
WRITELN(STATSOUT) ; 


WRITELN (ST ATSOUT , ' STHR  = 

'  ,STHR , ' 

SUBE  = 

' , SUBE , '  SUBF  =  ' 

,SUBF) ; 

WRITELN(STATSOUT,‘INSE  = 

*  ,INSE,' 

1NSF  = 

’ ,INSF) ; 

WRITELN (ST ATSOUT, 'DELE  = 

'  , DELE  , ' 

DELF  = 

' , DELF , '  DELC  =  ' 

, DEL C ) ; 

WRITELN (ST ATSOUT , ' DCNE  = 

'  ,DCNE , ' 

DCNF  = 

' ,DCNF  DCNG  =  * 

,DCNG) ; 

WRITELN (ST ATSOUT , '  SFE  = 

',SFE,' 

SFF  -  ' 

, SF  F ) ; 

WRITELN (ST ATSOUT , ' CHVE  = 

'  ,CHVE , ' 

CHVF  = 

' ,CHVF) ; 

WRITELN (ST ATS OUT ,' STATE = 

' , STATE,' 

STATF= 

' ,STATF , '  STATG  = 

' ,STATG 

WRITELN(STATS0UT,‘THR1E-- 

' ,  THR 1 E » ' 

THR  1 F  = 

’ , THR 1 F ) ; 

WRITELN( ST ATSOUT , ' THR2E5 

' , THR2E ,' 

7HR2F  = 

' , THR2F ) ; 

WRITELN(STATSOUT) ; 

WRITELN(STATSOUT) ; 

GETWORDVARS; 

WRITELN(STATSOUT) ; 

WRITELN(STATSOUT) ; 

WRITELN(STATSOUT,'THE  WORD  FUZZY  VARIABLES  FOLLOW'); 

WRITELN(STATSOUT) ; 

WRITELN (ST ATSOUT  ,'WSTHR  =  '  ,STHR , '  WSUBE  -  ',SUBE,'  WSUBF  -  ',SUBF) 
WRITELN(STATSGUT,'WINSE  =  ',INSE,'  WINSF  =  ',INSF>; 

WRITELN (ST ATSOUT , ' WDELE  *  '  ,DELE  , '  WDELF  =  '  ,DELF , '  WDELG  =  ' ,DELG) 
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WRITELN ( STATSOUT , '  UDCNE  =  ',DCNE,'  WDCNF  =  ' >DCNF  , '  WDCNG  =  ‘  ,DCNG> 
WRITELN<STATSOUT , ' USFE  =  ',SFE,'  WSFF  =  ',SFF>? 

WRITELN (STATSOUT , ' WCHVE  =  ',CHVE,'  WCHVF  =  *,CHVF)? 

WRITELN (STATSOUT , ' WSTATE1  STATE,'  USTATF=  ' ,STATF ,'  WST ATG  =  ' >STATG) ? 
WRITELN(STATS0UT,’UTHR1E=  ‘  ,THR1E ,  ‘  WTHR1F=  ',THR1F)? 
WRITELN(STATS0UT,'WTHR2E=  ',THR2E,'  WTHR2F5  ',THR2F>? 
WRITELN(STATSOUT) ? 

WRITELN(STATSOUT)? 

WRITELN(STATSOUT) ; 

WRITELN (STATSOUT , ' «  TIMES  EACH  WORD  PHONEME  IS  DELETED')? 

WRITELN(  STATSOUT , ' . '  )? 

URITE(STATSOUT, 'PHONEME  REP  =  ')? 

FOR  I:«  1  TO  10  DO 
BEGIN  ( *FOR* ) 

WRITE ( STATSOUT >WDATA«PHREPC1 3 :5) ? 

END ( *FOR* ) ? 

WRITELN(STATSOUT) ? 

WRITE(STATSOUT , ' #  DELETED  :  *)? 

FOR  1 5*1  TO  10  DO 
BEGIN  (#FOR*) 

WRITE (STATSOUT  ,NDELCI3:5)? 

END  ( *FOR * ) ? 

WRITELN(STATSOUT); 

WRITELN(STATSOUT); 

WKITE(STATSOUT, 'PHONEME  REP  -  ')? 

FOR  If-  11  TO  20  DO 
BEGIN  <*FOR») 

WRITE ( STATSOUT >WDATA*PHREPC1 3 :5)  ? 

END ( »FOR* ) ? 

WRITELN (STATSOUT ) ? 

WRITE(STATSOUT,‘#  DELETED  =  ’)? 

FOR  1**11  TO  20  DO 
BEGIN  (*FOR«> 

WRITE< STATSOUT, NDELCI3 55)? 

END  (*F0R»)? 

WRITELN(STATSOUT)  ? 

WRITELN(STATSOUT) ; 

WRITELN(STATSOUT) ? 

WRITELN(STATSOUT)  ? 

WRITELN (STATSOUT , ' «  TIMES  PHONEME  X  SUBSTITUTED  FOR  WORD  PHONE')? 

WRITELN(STATSOUT , ' . 1  )? 

WRITELN(STATSOUT)  ? 

WRITELN(STATSOUT,‘  ':40,'W0RD  PHONEMES')? 

WRITE(STATSOi:T,'  '  :4 , ' X' , '  's5)? 

FOR  I:=l  TO  10  DO 
BEGIN  <«FOR») 

WRITE(STATSOUT ,WDATA*PHREPtI3 :8) ? 

END  (#FOR»)  ? 

WRITELN ( STATSOUT  >  ? 

WRITELN( STATSOUT , ’ , '  :60)? 

WRITELN (STATSOUT )  ? 

FOR  J : =  1  TO  NOPHONEMES  DO 
BEGIN  (*FOR») 

WRITE (STATSOUT ,  J:5 , '  ';5); 
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FOR  1 5  =  1  TO  10  DO 
BEGIN  <«FOR*> 

HRITE(STATS0UT,XSUBY[I,J]:8s3)*, 

END  (#FQR») ; 

WRITELN(STATSOUT); 

END  (»FOR#) ; 

WRITELN ISTATSOUT  > ; 

WRITELN( STATSOUT) ; 

IF  WDATA.PHREPCinOOO  THEN 
BEGIN  (#IF*) 

WRITELN (STATSOUT , '  '  ;40  , ' WORD  PHONEMES' ) ; 

WRITE( STATSOUT f '  ‘:5>; 

FOR  Ij-11  TO  20  DO 
BEGIN  («FOR«) 

WRITE ( ST ATSOUT ,W DATA. PHREPC 13:6) ; 

END  ( *FOR») ; 

WRITELN(STATSOUT) ; 

WRITELN (STATSOUT > '  . '  i*0); 

WRITELN(STATSOUT)  ; 

FOR  Ji=  1  TO  NOPHONEMES  DO 
BEGIN  <*FOR») 

WRITE (STATSOUT , J : 5 # '  ':5>; 

FOR  1 s  =  1 1  TO  20  DO 
BEGIN  ( #FOR*) 

WRITE (STATSOUT ,XSUBYC1 ,J3:S:3) ; 

END  <«FOR*); 

WRITELN ( STATSOUT ) ; 

END  ( *FOR* ) ; 

END(#IF») 5 
WRITELN(STATSOUT) ; 

WRITELN ( STATSOUT ) ; 

WRITELN  (STATSOUT t '  »  TIMES  PHONEME  X,  NSERTED  FOR  WORD  PHONE 

WRITELN(STATSOUT  . 

WRITELN(STATSQUT )  ; 

WRITELN(STATSOUT ' s40,'W0RD  PHONEMES'); 

WRITE(STATSOUT,'  ':4,'X','  ‘:5); 

FOR  1**1  TO  10  DO 
BEGIN  (*FOR#) 

WRITE  (STATSOUT, WDATA.PHKEPm :6) ; 

END  <  *  FOR  « ) ; 

WRITELN(STATSOUT); 

WRITELN(STATSOUT,' :80>; 

WRITELN < STATSOUT ) ; 

FOR  J : =  1  TO  NOPHONEMES  DO 
BEGIN  ( *  F  OR  * ) 

WRITE (STATSOUT :5> '  ':5); 

FOR  I s  =  1  TO  10  DO 
BEGIN  (*FOR») 

WRITE (STATSOUT, XINAFTZCI , J] :B:3) ; 

END  (*FOR»); 

WRITELN(STATSOUT) ; 

END  (»FOR«) ; 

WRITELN(STATSOUT) ; 

WRITELN ( STATSOUT  > ; 
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IF  WDATA.PHREPCllIOOO  THEN 
BEGINUIF*) 

WRITELN(STATSOUT,'  ':40,'W0RD  PHONEMES'); 

WRITE( STATSOUT f '  ':4,'X','  ':5); 

FOR  I s  =  11  TO  20  DO 
BEGIN  UFOR*) 

WRITE (STATSOUT ,WDATA«PHREPCI] :8) ; 

END  <«FOR»); 

WRITELN(STATSOUT)i 
WRITELN(STATSOUT,'  :60); 

WRITELN(STATSOUT) ; 

FOR  J:=  1  TO  NOPHQNEMES  DO 
BEGIN  <«FOR») 

WRITE (STATSOUT ,J  s5,  ’  *:5>; 

FOR  I :=11  TO  20  DO 
BEGIN  ( *FOR#) 

WRITE (STATS OUT ;XINAFTZC I, J3:fi:3); 

END  UFOR*); 

WRITELN(STATSOUT); 

END  <»FOR«); 

END  (*IF») ; 

WRITELNC THE  STATISTICS  FOR  '  ,WDATA. SPELLING,'  ARE  STORED'); 
WRITELNCIN  FILE  FILENAME, ‘  ".') ; 

ENDUWITH*); 

CLOSE(STATSOUT); 


END(«PROCEDURE  AUTOOUT*); 


{»****»»***»»**#»»***»*»**#*»»***#*X)H»*»****«**»#*»»******«*»*#»»»» 


PROCEDURE  OUTSTATS  DESCRIPTION 


THIS  PROCEDURE  ACCEPTS  THE  NAME  OF  THE  WORD  WHOSE  INFORMATION 
IS  TO  BE  OUTPUT  AND  THE  NAME  OF  THE  OUTPUT  FILE,  AND  THEN  CALLS 
PROCEDURE  AUTOOUT. 


PROCEDURE  OUTSTATS; 


1,J:1NTEGER; 

UORD;STRINGCSPELLENGTH3 ; 
TEHPUORD ; STR I NGt SPELL ENG TH I ; 
UORDNO:INTEGER; 

AVESCORE :REAL ; 
ANSUER:STRINGL3] ; 


BEGIN  UOUTSTATS*) 

INITFILE; 

RESET ( WFI LE ) ? 

URITELN ( ' INPUT  NAME  OF  OUTPUT  FILE'); 

WRITEC  FILENAME  *  '); 

READLN(FILENAME) ; 

URITELN; 

FILETITLEt STATSOUT, FILENAME  ); 

REURITE(STATSOUT) ; 

RESET(SFILE); 

U0RDN0;--0; 

SEEK(UFILE,UORDNO); 

GET(UFILE) ; 

URITELN; 

URITELN  ( '  THE  FOLIOk'ING  IS  THE  CURRENT  VOCABULARY') 
URITELN; 

UHILE  NOT  EOF(UFILE)  DO 
BEGIN  UWHILE*) 

UDATA:sUFILE* ; 

URITELNtWDATA. SPELLING); 

GET(UFILE) ; 

END  UUHILE*); 

URITELN; 

URITELN; 

WR I TELN ( ' INPUT  WORD  TO  OUTPUT  STATISTICS'); 
URITELN; 

URITE  ( ' WORD  =  '); 

READLN  (UORD ) ; 

URITELN  ; 

FOR  I s  =  (LENGTH(HORD) )  TO  < SPELLEHGTH- 1 )  DO 
BEGINUFQR*) 

UORDs  =  CONCAT  < WORD, '  '); 

END(»FOR») ; 

RESET  (UF1LE) ; 

TEHPUORD:- '  '; 

UORDNO: :0 ; 

UHILE  (NOT  EOF(UFILE) ) AND ( UORD < ) TEHPUORD)  DO 
BEGIN  <«UHILE«> 

SEEK ( WFILE , UORDNO ) ; 

GET(UFILE) ; 


263 


UDAT A s  =  UFILE* ; 

GET ( WF I LE > ; 

TEMP WORD : 5 UD AT A. SPELLING; 
WORDNO:=UQRDNO+  1 
END  UWHILE*); 

IF  (WQRDOTEHPWORD)  THEN 
BEGIN  ( *  I F  * ) 

WRITELN; 

WRITELN< ' WORD  DOES  NOT  EXIST.'); 
END  (*IF») 

ELSE 

BEGIN ( *ELSE* ) 

WORDNO:=WORDNO- 1 ; 

AUTOOUT(UORDNO) ; 

ENDUELSE*) ; 


END  ^PROCEDURE  OUTSTATS*); 


{if**##*###********###****#****#*#####*#***#*****#*##***##**#*##**### 

PROCEDURE  ALLOUT  DESCRIPTION 


THIS  PROCEDURE  OUTPUTS  THE  INFORMATION  FOR  EVERT  WORD  IN  THE 
CURRENT  VOCABULARY  TO  FILES  UHOSE  NAMES  ARE  GIVEN  BY  THE  CONCAT¬ 
ENATION  OF  THE  CHARACTER  "S"  AND  THE  SPELLING  OF  THE  WORD. 


#*#*»****#**########***##### »«**«** *«****»*#***#**«***#*»**»#**###»*} 


PROCEDURE  ALLOUT; 

VAR 

UORDNOsINTEGER ; 

BEGIN ( «ALL0UT« ) 

INITFILE; 

RESET(SFILE) ; 

UORDNO :  =  0 ; 

RESET(WFILE); 

WHILE  NOT  EOF(WFILE)  DO 
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BEGINUUHILE*) 

SEEK (UFILE ^WORDNO) ; 

GET(WFILE) ; 

UDATAs  =  WFILEA ; 

FILENAHE: sCONCAT( ' S‘ , WDATA. SPELLING) ; 
FILETITLEtSTATSOUT, FILENAME); 

REWRITE ( STATSOUT ) ; 

AUTOOUT (WORDNO)  ; 

SEEK ( WF HE  , WORDNO) ; 

GET(WFILE) ; 

GET(WFILE) ; 

WORDNO : =WORDNO+l ; 

END ( *WHILE* ) ; 

ENDUPROCEDURE  ALLOUT*); 


BEGIN! *PROGRAH  OUTSTAT*) 


URITELN; 

URITELN! 'CHOOSE  OPTION  FROM  FOLLOWING  LIST*)? 
URITELN; 

HRITELNC1  -  OUTPUT  STATISTICS  FOR  A  GIVEN  WORD.') 
URITELN! '2  -  OUTPUT  STATISTICS  FOR  ALL  WORDS.')? 
URITELN; 

WRITE! ' OPTION  =  '); 

READLN(OPTION) ; 

URITELN; 

IF  OPTION  *  1  THEN 
BEGIN! »IF») 

OUTSTATS; 

END ( *IF* ) 

ELSE  IF  OPTION  *  2  THEN 
BEGINUELSE*) 

ALLOUT; 

END ( *ELSE* ) 

ELSE 

BEGINUELSE*) 

URITELN! ' OPTION  DOES  NOT  EXIST'); 

ENDUELSE*) ; 


END  UPROGRAH  OUTSTAT* ) 


C  ***************************** a***##*#****###****##**#***** 

C  *»»»#»**#*#***X*****»»#**X*#»***####IHHt**##»#*##***###*#»# 


C 

«* 

** 

c 

*« 

PROGRAM:  P  H  D  I  S  T 

«« 

c 

** 

** 

c 

** 

MRITTEN  BY :LT  KARL  SEELANDT 

«x 

c 

*« 

«* 

c 

** 

DATE-.19  NOV  1981 

«« 

c 

«x 

*# 

c 

*K 

IK 

c 

•« 

REVISED  BY:  LT  JERRY  MONTGOMERY 

** 

c 

** 

** 

c 

** 

DATE:  17  APRIL  1982 

** 

C 

*« 

.  ** 

C  *******#*#*##****#*#*************##*******####***#*******# 

C  »#tHHHt**»#*****#**##*K*###*##**#**#***X*##*#«*#***###»#*## 

c 

c 

C  NOTE:  THIS  ROUTINE  IS  FOR  FORTRAN  5 

C 

C 

C******  Tis  program  is  a  modification  of  a  program 
C  mritten  by  Lt.  Karl  Seelandt  here  at  AFIT.  The  purpose 
C  of  this  program  is  to  determine  the  distances  between 
C  spectrographic  frequency  components  of  an  input 
C  speech  sentence  and  a  set  of  prototype  phoneme 
C  templates.  The  prototypes  are  stored 
C  in  DISK3  and  the  sentence  is  read  from  DSPEECH. 

C  The  routine  must  be  set  up  to  manipulate 

C  the  digital  speech  in  the  same  exact  manner 
C  that  the  prototype  phonemes  mere  formed  since  it 
C  creates  the  spectrogram  of  the  speech  before 
C.  actually  doing  the  distance  measurement. 

C 

C*«*m**  This  program  is  set  up  to  do  any  of  the 
C  manipulations  that  the  program  SGRAPI2  can  do. 

C  This  allows  the  study  of  various  manipulations 
C  and  their  effect  on  the  recognition  accuracy. 

C 

C 

C  INITIALIZE  ARRAYS 
C 

DIMENSION  ISENT ( 512,32)  , IPHONE ( 16,32)  ,RESULT(512)  ,B(84)  ,IBIU4) 
DIMENSION  1DSP(512),RHINMAX0>12,2) 

COMPLEX  DTARAY (128) 

INTEGER  DSPEECH (13) 

C 

C  INITIALIZE  CONTROL  VARIABLES  FOR  STANDARD  OPERATION. 

C 

IPRE-1  ;FLAG  TO  DO  PREHPHASIS 

I DB  =  6  ;NUHBER  OF  DB'S  PER  OCTAVE  PREEMPHASIS 

FREQ  =  500 . 0 ;ST ART  INC  FREQUENCY  FOR  PREEMPHASIS 

I  PRESS  =  2  ;  F  L  A  C  TO  NOT  DO  FREQUENCY  COMPONENT  COMPRESSION 

ISIZE=64  ;NUHBER  OF  SAMPLES  PER  DFT 
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ILENGTH=5iLENGTH  OF  PHONEMES  IN  TIME  INCREMENTS 
NUMPP  =  39  ;NUMBER  OF  PROTOTYPE  PHONEMES 
SFREQ=8000 . 0  ^SAMPLING  FREQUENCY 

INITIALIZE  MIN-MAX  VARIABLES  USED  TO  KEEP  TRACK  OF  THE  MINIMUM 
; AND  MAXIMUM  DISTANCES  DETERMINED  FOR  EACH  TIME  FRAME. 

DO  500  1=1,512 

RH INMAX (I ,1 >=100000000.0  ;MAX  POSSIBLE  DISTANCE  FOR  MINIMUM  VARIABLE 
RHINHAXd  ,2)=0.0  ;MIN  POSSIBLE  DISTANCE  FOR  MAXIMUM  VARIABLE. 

500  CONTINUE 

INPUT  CONTROL  VARIABLES  AND  OPEN  FILES. 


ACCEPT“ENTER ,  name  of  speechfile  to  be  recognized:" 

READ( 11 ,510)DSPEECH( 1 ) 

510  F0RMAT(S13) 

OPEN  3,“RESTEMF'“  ; 

CALL  0PEN(5,DSPEECH,1,512,IER1)  ;speechfile  data 
CALL  CHECK  (IER1) 

ACCEPT  "FIRST  BLOCK  TO  READ'-"  ,ISTART 
ACCEPT  “LAST  BLOCK  TO  READ=“,1LAST 

I SLEN  =  < (1LAST  +  1 )*25A)/ISIZE  ^ LENGTH  OF  SENTENCE ( VECTORS > 

ISIZEH= ISIZE/2  ^NUMBER  OF  COMPONENTS  PER  VECTOR 

ISAMP= ( (256 .0/1  SIZE) #1  START >  +  1  ^NUMBER  OF  1ST  TIME  INCREMENT  USED. 
ISANP1=ISAMP  ;SAVE  THIS  INITIAL  VALUE  FOR  LATER 
DELETE  “DCOMPS" 

OPEN  4, “DCOMPS", LEN=ISIZE 

JSIZE= ISIZE*ILENGTH+2  ; RECORD  SIZE  OF  PHONEME  FILE 


IHAM=1 

NUHPP=71 

ENTHRES=60 


OPEN  2,"DISK3HAM",LEN=JSIZE 


CALL  FCT I HE (I  HOUR, I  MIN,  I  SEC)  ?GET  STARTING  TIME 


5  SECTION  OF  PROGRAM  TO  FORM  SENTENCE  ARRAY, 


DO  1000  IN=ISTART,I LAST 

CALL  RDBLK(5 ,IH,IDSP ,  1 , 1 E R )  g REP D  IN  DIGITIZED  SPEECH 
IF(IER.NE.l)  TYPE  "ERROR  ON  RDBLK  ,IER="  ,IER 
J1  =  0  ^OFFSET  USED  TO  SPLIT  UP  BLOCK  OF  SPEECH  DATA 
600  CONTINUE 

;SET  UP  COMPLEX  ARRAY  FOR  DFT 

;And  iff  requested  use  aHaaiiinq  window  on  the 
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{samples  as  shown  in  Oppenhien  and  Shafer's 

digital  Signal  Processing  (p.  242). 

* 

DO  410  J  =  1 , ISIZE 
8(J)=IBSP(J+J1) 

C 

UNDO  =  0.54  -0.46  *  C0S(  2.0  * 

/  3.1415926534  «  (J-D/1SI2E  > 

C 

B(J)  -  UNDO  *  B(J)  {window  dspeech  points 

C 

DTARAY (J) =B(J) 

610  CONTINUE 

LARAY=ISIZE  ;#  OF  POINTS  FOR  DFT 
INVER=0  ^SPECIFIES  A  FORWARD  TRANSFORM 
CALL  DFT5(DTARAY,LARAY, INVER) 

{Must  be  sure  that  the  Magnitudes  of  the 
{two  differently  windowed  components  are 
;are  equivalent;  do  this  be  increasing 
{Magnitude  of  Hanning  windowed  components 
{assuming  Magnitude  of  Rect.  windowed 

{components  =  1. 

% 

{FIND  MAGNITUDE  OF  FREQUENCY  COMPONENTS 
DO  625  1=1 jISIZEH 
B(  I ) =CABS(DTARAY  < I ) /LAR A Y ) 

IF  (IHAH.EQ.l)  B(I)  =  B(I)  /  0.54 

625  CONTINUE 

PREEMPHASIZE  HIGH  FREQUENCY  COMPONENTS. 

IF  (IPRE.NE.l)  GO  TO  45 

{FIND  1ST  COMPONENT  TO  START  PREEHPHASIS  AT 
IFREQ1=IFIX(FREQ/SFREQ#ISIZE)+1 
DO  700  1=1FREQ1,1SIZEH 

{CONVERT  INTEGER  VARIABLES  TO  REALS 
RI  =  I 

R1FREQ1=IFREQ1 

RIDB=IDB 

{MODIFY  MAGNITUDE  OF  COMPONENTS 
B(I)=BU)M10**(RIDB/20»ALOG10<RI/RIFREQ1)/ALOGIO(2,0>)> 

700  CONTINUE 

FREQUENCY  VECTOR  COMPRESSED  TO  16  CHANNELS  IF  COMPRESSION  WAS  SELECTED. 

45  IF  < I PRESS • NE . 1 )  GO  TO  30 

.{FIND  HUMBER  OF  COMPONENTS  TO  COMBINE  INTO  1  COMPONENT 
JNUM=ISIZEH/14 
J  J 1  =  1 
JJ2=JNUH 
DO  20  1=1,16 
SUM=0.0 
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DO  10  J=JJ1,JJ2 
10  SUH=SUM+B(J) 

B(I)=SUH 

jji=jji+jnum 

J J2s J J2+ JNUM 
20  CONTINUE 

NORMALIZATION  OF  ENERGY  IN  SIGNAL 

30  IF  (IPRESS.EQ.l)  ISIZEH=16 
SUME*0,0 

DO  33  J  =  1 , ISIZEH 
SUME=SUME+(B(J)*»2> 

33  CONTINUE 
ENERGY=SQRT(SUHE) 

IF  (ENERGY. GT.ENTHRES)  GO  TO  32 

^INCREASE  VALUE  OF  ENERGY  TO  ATTENUATE  COMPONENTS 
ENERGY=ENTHRES*5 
32  CONTINUE 

IMULT=32760  ;RANGE  OF  CONPONENTS  AFTER  NORMALIZATION* 

DO  34  J  =  l, ISIZEH 
B(J)MB(J)/ENERGY)»IMULT 

34  CONTINUE 

ARRAY  VALUES  CONVERTED  TO  INTEGER  FORM 

DO  240  JJ=1, ISIZEH 
IBI(JJ)=IFIX(B( JJ) )  + 1 
240  CONTINUE 

;SAVE  COMPONENTS  INCASE  NEEDED  LATER  BY  *PPGEN‘ 

IREC= ISAHP  ^CORRECT  RECORD  TO  WRITE  TO  IN  ‘DCOMPS' 

CALL  WRITR(4,IREC,IBI,1,IER21) 

CALL  CHECK  (IER21) 

FORM  SENTENCE  ARRAY 

DO  215  1  =  1, ISIZEH 
I  SENT (IS AMP ,1 ) *181 ( I ) 

215  CONTINUE 

ISIZEH=  ISIZE/2  ‘.RESTORE  VALUE  IN  CASE  ALTERED, 

ISAMP=1SAMP+1 
J1 = Jl+ISIZE 

IFIJl.LT. 256)  GOTO  600  ;LOOP  IF  MORE  VECTORS  POSSIBLE  FROM  BLOCK 
1000  CONTINUE 

CLOSE  4  ;CI ose  the  DCOMPS  file,- 

READ  IN  PROTOTYPE  PHONEMES  AND  DETERMINE  RECOGNITION  RESULTS, 

IF( IPRESS.EQ. 1 )  ISIZEH1 16 

;LOOP  TO  SELECT  1  PROTOTYPE  AT  A  TIME 
DO  900  I PP  =  1  ,NUi1PP 

IREC=IPP  ^RECORD  TO  READ  FROM  DISK3 
CALL  READRW(2,IREC,IDSP,1,IER7) 

IF  UER/.NE.l  HYPE"  ERROR  1ER7  =",IER7 
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CALL  CHECK  (IER7) 

J0FFSET=0  ;OFFSET  USED  TO  SETUP  THE  ARRAY  IPHONE 
DO  920  1=1 jILENGTH 
DO  910  J= 1 f ISIZEH 

IPHONEt I , J ) = IDSP ( J+ JOFFSET+1 )  ; ADD  1  TO  SKIP  VARIABLE  FOR  NUMBER  OF 

;TIHES  THF.  PHONEME  HAS  BEEN  MODIFIED. 

910  CONTINUE 

920  JOFFSET=JOFFSET+ ISIZEH 

C 

C  DETERMINE  DISTANCE. 

C 

RNAX  =  0«  0 

INOSHFT= ISAMP1 -1  ;MIN  SHIFT  REQUIRED  TO  GET  ALL  DISTANCES 
IMAXSHFT=ISLEN-ILENGTH-1  ;MAX  SHIFT  REQUIRED  TO  GET  ALL  DISTANCES. 

?L00P  THAT  GETS  ALL  DISTANCES  FOR  THIS  PROTOTYPE 
DO  080  ISHIFT=INOSHFT,IMAXSHFT 
RSUMDIS=0.0  ;USED  TO  SUM  UP  DISTANCE 

^CALCULATE  2-DIMENSIONAL  DISTANCE  FOR  THIS  TIME  FRAME 
DO  850  1=1 ,ILENGTH 
IS  =  I  +  ISHIFT  5 AMOUNT  TO  SHIFT 
DO  800  J  =  1  / ISIZEH 

800  RSUMDIS=RSUMDIS+IABS(1SENT( IS, J) -IPHONE! I,J>) 

850  CONTINUE 

;TEST  FOR  MAX-MIN  DISTANCES  FOR  THIS  TIME  FRAME 
IF(RSUNDIS.GT.RHAX)  RMAX=RSUMDIS 

IF(RSUMDIS,LT.RMINMAX<ISHIFT+1,1>>  RMINMAX( ISHIFT+1 ,1 >=RSUMDIS 
IFIRSUMDIS .GT , RMINMAXI ISHIFT+ 1 >2 ) )  RHlNMAXUSHIFT+l ,2) =RSUMDIS 
?SAVE  RESULT  FOR  THIS  TIME  FRAME 
RESULT ( ISHIFT+1 )=RSUMDIS 
C 

C  FIND  DISTANCE  BETWEEN  EACH  VECTOR  OF  EACH  SECTION  OF  SPEECH 
C  TO  CHECK  FOR  POSSIBLE  PHONEME-PHONEME  TRANSITIONS.  DO  ONLY  ON  1ST 
C  PASS  THRU  THIS  SECTION  OF  PROGRAM. 

C 

X  IF( IPP.NE. 1 ) GOTO  880  ?SKIP  THIS  IF  ALREADY  DONE  ON  1ST  PASS  THRU 
X  TRDIFF=0 ,0  ;USED  TO  SUM  UP  DIFFERENCE 
X  DO  870  J  =  l, ISIZEH 

X870  TRDIFF=TRDIFF+ I APS( ISENT ( ISHIFT+1 , J)-ISENT( ISHIFT+2,J) ) 

X  CALL  FSEEK(1,ISHIFT>  ^SELECT  CORRECT  RECORD 
X  WRITE  BINARY ( 1 )  TRDIFF  ;SAVE  TRANSITION  DISTANCE 
880  CONTINUE 
• 

;FILL  UP  BEGINNING  OF  TRANSITION  DISTANCE  FILE  WITH  A  LARGE  VALUE 

X  TRD I FF= 1000000000,0 
X  DO  885  1=0,IN0SHFT 
X  CALL  FSEEK (1,1) 

X885  WRITE  BINARY ( 1)  TRDIFF 

C 

C  FILL  UP  REMAINDER  OF  RESULT  ARRAY  W/  MAX  DISTANCE 
C 

I INC= IMAXSHFT+1 
DO  890  I=IINC,ISLEN 
890  RESULT(I)=RMAX 


r>  x  o  r>  o  o  r>  n  o 


WRITE  RESULTS  OUT  TO  TEMPORARY  FILE. 

DO  895  I  =  I  SAMP 1 , ISLEN 
895  WRITE  B1NARY(3)RESULT(I) 

TYPE  "PHONEMES  COMPLETED-  < 7  > < 7 > <  7 > ” r IPP 
900  CONTINUE  ;END  OF  LOOP  TO  CALCULATE  DISTANCE  FOR  ALL  PHONEMES. 

RECONFIGURE  RESULTS  TO  BE  COMPATIBLE  WITH  INTPLOT  AND  OUTPUT  TO  "CORRAR". 


CLOSE  1  ;transtion  file  closed. 

C**«**«  Store  a  scale  factor  for  each  tine  fraae.  This 
C  factor  will  give  a  neasure  of  ho*  accurate  a  choice 


C  was  wade  in  recognizing  phonenes  for  this  tine  period. 
C  This  scale  nil  1  be  used  by  the  CH0ICE5  routine. 

C 

C 

DELETE  “SFACTOR" 

OPEN  1,"SFACT0RV  ;scale  factor 

RMAXHAX:0 . 0 

DO  993  ISC=ISAHP1, ISLEN 


IF  (RHINHAX(1SC,1).GT. 90000000. 00)G0T0  993 
IF  (RMINHAX( ISC,1 ) .GT.RMAXMAX)  RMAXHAX=RMINMAX( ISC, 1 ) 
993  CONTINUE 

Pn  990  ISC=ISAMP1  , ISLEN  ; 

RHINHIN=RMINHAX( ISC ,1)/RHAXHAX 
WRITE  (1,493)  RMINMIN  ?store  'e», 

990  CONTINUE  ; 

IZERO  *  0.0  {fill  end  of 

DO  991  JZS1,4  ;file  with  0. 

WRITE  BINARY ( 1 )  IZERO  ; 

991  CONTINUE  ; 

CLOSE  1  ;Keep  channel  count  as  Ion  as  possible 

$or  the  progran  nil)  DIE. 


DELETE  "RFACTOR" 
OPEN  1, "RFACTOR' 


grange  factor 


REWIND  3 
DELETE  "CORRAR" 

CALL  CFILW ("CORRAR" ,2,IER99) 

IF ( I ER99 • NE ,1)TYPE" ERROR  on  CFILW  IER99*  ",IER99 


CALL  0PEN(4, "CORRAR", 2, 512, IER18) 
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non  ora  o  o  r?  o  n 


IF(IER18.NE, 1 ) TYPE "ERROR  IER18=“ # IHR1S 
C 

ISTBLK=0 

% 

?L00P  TO  OUTPUT  2  RECORDS  FOR  EACH  PHONEME 
00^2000  in  =  l /NUMPP 

;USE  HAX-MIN  DISTANCES  TO  SCALE  RESULTS  FOR  EACH  TIME  FRAME. 

;0  IS  ASSIGNED  TO  PHONEME  W/MAX  DISTANCE. 

;32767 ( MAX  INTEGER  SIZE)  IS  ASSIGNED  TO  PHONEME  H/HIN  DISTANCE. 
DO  1500  I = ISAHPl , ISLEN 
RF ACTOR =RHINM AX ( 1,2) -RMINMAXi I , 1 ) 

C 

C******  Store  the  weighting  factor  in  a  file  for  use  by 
CH0ICE5.  This  statistic  Rill  give  a  Measure  of  hoR 
■uch  of  a  variance  there  is  between  the  best  and  the 
Rorst  phoneee  natch  for  each  tiae  period. 


IFUM.GT.DGO  TO  1300  ;do  only  on  first 

;pass  thru. 

WRITE  (1,696)  RFACTOR 
693  FORMAT (F 11 .8) 

696  FORMAT ( FI 1 • 1 )  ^factor  foraat  stat. 

1300  CONTINUE 


READ  BINARY(3)  RESULT! I ) 

IDSP(I)=IFIX\ (RNINMAXd ,2)-RESULT(I))/RFACT0R*32767) 
1500  CONTINUE 

«  t 

;ZER0  FILL  REMAINDER  OF  RESULT  FILE 

;FILL  BEGINNING  OF  FILE  FOR  UNUSED  TIME  FRAMES, 

DO  1700  1  =  1  ,IN0SHFT 
I DSP ( I )  =  0 

1700  CONTINUE 

;F I LL  END  OF  FILE  FOR  UNUSED  TIME  FRAMES. 
IINC=ISLEN+1 
DO  1800  1  =  I INC  ,512 
1 DSP ( I ) =0 

1800  CONTINUE 

;OUTPUT  RESULTS. 

CALL  WRBLK < 4 , ISTBLX , IDSP ,2 , IER20 ) 
IF(IER20.NE.i)TYPE"ERR0R  IER20=" ,IER20 
CALL  CHECK ( IER20 ) 

ISTBLK--ISTBLK  +  2 
2000  CONTINUE 

CLOSE  FILES  AND  TERMINATE  PROGRAM, 


C 


CALL  FGTIME( IH0UR1 , I  MINI ,  I  SEC  1 ) 
WRITE  <10,99)IHQUR,IHIN,ISEC 
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WRITE  <10,98)IH0UR1,IHIN1,ISEC1 

99  FORMAT C‘  "  /‘STARTING  TIME  =  *,313) 

98  FORMATC  ” /‘FINISHED  TIME  =  *' , 3 1 3 ) 

TYPE  “ < 1 5  > ” 

TYPE“SENTENCE  LENGTH  IS  -  “,1SLEN 
TYPE‘‘NUMBER  OF  PHONEMES  =  ‘‘,NUHPP 

CALL  RESET 
DELETE  “RESTEMPl" 

WRITE  (10)  ‘‘<7><7><7><7><7>‘‘ 

STOP 


C  THIS  PROGRAM  TAKES  UP  TO 

C  UP  TO  75  PHONEMES . 

C 

C  .  •  •  .  •  <  .  i  < 

C  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

C  xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx* 


C 

XX 

XX 

c 

XX 

PROGRAM:  CHOICES 

XX 

c 

XX 

XX 

c 

XX 

WRITTEN  BY:  LT  KARL  SEELANDT 

XX 

c 

XX 

XX 

c 

XX 

DATE:  1  DEC  1981 

XX 

c 

XX 

XX 

c 

XX 

REVISED  BY:  JERRY  MONTGOMERY 

XX 

c 

XX 

XX 

c 

XX 

DATE:  15  APRIL 

XX 

C  XXXXXXXXXKXXXXXXXXKXKXXXXXXXXXXXXXXXXXXKXXXXXXXXXXKXKXXXXX 

c  xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

c 

c 

c 

cxxxxxx  NOTE:  THIS  ROUTINE  IS  FOR  FORTRAN  5  ONLY!!!! 

C 

C 

Cxxxxxx  This  program  is  used  with  the  program  PHDIST 
C  to  locate  and  tabulate  useful  information  about 
C  sound  units  is  speech.  The  routine  is  currently 
C  set  up  to  handle  a  maximum  of  7i  phonemes. 

C  The  files  CORRAR,  RFACTOR  and  SFACTOR  are 
C  created  by  the  program  PHDIST  and  contain  the 

C  distances  from  each  prototype  sound  unit  template 

C  to  each  section  of  speech;  along  with  scaling  and 
C  range  factors  used  in  determining  these  distances. 

C 

Cxxxxxx  There  are  two  output  files  generated  by  this 
C  program;  0UT1  and  0UT2.  The  first  contains  a 
C  tabular  listing  of  the  top  five  phoneme  choices 
C  for  each  time  period;  along  with  the  scale  factor; 

C  The  second  fiie  (0UT2)  is  identical  to  0UT1  except 
C  that  the  noise  occurring  before  and  after  the  actual 

C  speech  has  been  removed. 

C 

C 

c 

c 


INTEGER  SYMBOL (71 ) 

DIMENSION  ISAID(50) .ISPKR(IO)  .IDATEI3) 

DIMENSION  IBLK(512> ,IST0RE(200,71) 

DIMENSION  ICH1VAL(200),ICH2VAL(200),ICH3VAL(200) 

DIMENSION  ICH4VAU200)  ,  I CH3VAL ( 200 )  ,  ICH1PH0 (  200 )  ,  ICH2PH0  ( 200 ) 
DIMENSION  ICH3PH0(200; , ICH4PH0 ( 200) , ICH5PHG< 200 > 

COMMON  /  NAME  /  SYMBOL 
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A124  851  ISOLATED  WORD  RECOGNITION  USING  FU2ZV  SET  THEORV(U)  AIR 
FORCE  INST  OF  TECH  NRIGHT-PATTERSON  AFB  OH  SCHOOL  OF 
ENGINEERING  G  J  NONTGOHERV  DEC  82  AFIT/GE/EE/82D-74 

.F/G  5/8 


UNCLASSIFIED 


C  THIS  PROGRAM  TAKES  UP  TO 

C  UP  TO  75  PHONEMES . 

C 

C  i  >  .  <  «  .  i  .  . 

C  ##*#*###»**»#********#**##**#it*##**####*######**#**####*## 

c  *##*#*##*#*#****#*#********#***#»*##***«*####***#*#**###** 


c 

H 

xx 

c 

»s 

PROGRAM:  CHOICES 

xx 

c 

XX 

xx 

c 

XX 

WRITTEN  BY:  LT  KARL  SEELANDT 

xx 

c 

XX 

XX 

c 

XX 

DATE:  1  DEC  1981 

XX 

c 

o 

XX 

c 

xx 

REVISED  BY:  JERRY  MONTGOMERY 

xx 

c 

xx 

XX 

c 

xx 

DATE:  15  APRIL 

XX 

c  «*«**»»»*»***»#****##»»**#*»**#*»»###»*#***#»*####*«###*#* 

C  #*«*»#»##»*#**#**************»**»*#****»#**»«»#*****»**### 

c 

c 

c 

C******  NOTE:  THIS  ROUTINE  IS  FOR  FORTRAN  5  ONLY!!!! 

C 

c 

C******  This  program  is  used  mith  the  program  PHDIST 
C  to  locate  and  tabulate  useful  information  about 
C  sound  units  is  speech.  The  routine  is  currently 
C  set  up  to  handle  a  maximum  of  71  phonemes. 

C  The  files  CORRAR,  RFACTOR  and  SPACTOR  are 
C  created  by  the  program  PHDIST  and  contain  the 

C  distances  from  each  prototype  sound  unit  template 

C  to  each  section  of  speech;  along  mith  scaling  and 
C  range  factors  used  in  determining  these  distances. 

C 

C****#»  There  are  tmo  output  files  generated  by  this 
C  program;  0UT1  and  0UT2.  The  first  contains  a 
C  tabular  listing  of  the  top  five  phoneme  choices 
C  for  each  time  period;  along  mith  the  scale  factor; 

C  The  second  file  (0UT2)  is  identical  to  OUTi  except 
C  that  the  noise  occurring  before  and  after  the  actual 

C  speech  has  been  removed. 

C 

C 

C 

C 


INTEGER  SYMBOL (71) 

DIMENSION  ISA1D(50>,ISPKR(10),IDATE<3) 

DIMENSION  I BLK (512) , I  STORE (200,7 1 ) 

DIMENSION  ICH1VAK200)  ;ICH2VAL(200)  ,ICH3VAL(200) 

DIMENSION  ICH4VAU200)  ,ICH5VAL(200)  ,ICH1PH0(200) , I CH2PH0 ( 200 ) 
DIMENSION  ICH3PH0(200);ICH4Ph0(2O0) ;ICH5PH0(200> 

COMMON  /  NAME  /  SYMBOL 
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ouu  U  U  U  U  U  U  U  O  CJ  o  o  o  oo  o  ooo 


DATA  SYMBOL/" 01" ,”02” ,“03“ ,“04“ ,“05“,“06" ,"07“  , 

♦  “  0  8  " ,  “  0  9  “ ,  “  1 0  1 T  V 1 2  ‘  1 3  " ,  “  1 4  " ,  “  1 5  ” , 

+  “18“ ,“17“ ,“18“ ,“19“ ,“20“ ,“21"  ,“22"  ,“23“ , 

♦  “24“ ,“25“ ,"28“ ,“27“ » "28”  ,“29” ,“30“  ,"31“ , 

♦  “  3  2  " ,  “  3  3  “ , "  3  4  “ ,  “  3  5  “  ,  “  3  8  " ,  “  3  7  “  ,  “  3  8  “ , "  3  9 " , 

♦  “40“ ,“41“ ,"42“ ,“43” ,"44“ ,”45"  ,"48“  ,"47“  , 

+  “48“ ,“49“ ,“50“ ,”51 " , “52“ , "53“  ,“54“ , “55" , 

♦  “58“,"57",“58“,"59“,“80","81","82“,"83", 

4  “84" ,“85“ ,"88“ , “87“ , “88“ ,  “89“  ,“70“  ,“71“/ 


OPEN  FILES, PRINT  HEADING,  INPUT  CONTROL  VARIABLES. 
CALL  DFILW( “0UT1" ,IER21 ) 

IF(IER21.NE.1)TYPE“ERR0R  on  DFILN  IER21*  “,IER21 

OPEN  4,"0UT1“  recognition  results  . 

CALL  DFILHC0UT2"  ,IER21) 

IF ( IER21 .NE. 1 ) TYPE "ERROR  on  DFILH  IER21*  ”,IER21 

OPEN  12,“0UT2“  recognition  results  . 

CALL  0PEN(1 ,“C0RRAR",1 ,512,1ER1) 

IF( IER1 .NE. 1 )TYPE“ERR0R  1ER1  *“,IER1 
CALL  CHECK  (IER1) 

OPEN  3,“SFACT0R“  scaling  factor  frot  PHDIST 


CALL  FGTIMEdH0UR,IHIN,ISEC) 
CALL  DATE(IDATE,IER2) 

CALL  CHECX  (IER2) 


«««««* 


Obtain  and  then  write  out  to  OUT ( 1 ) 


about  the  speechfile  recognized. 


info 


ACCEPT"  ENTER  the  words  spoken  (  <  50  char  ): 
/  <15>" 

READdl  ,15)  ISAIDd) 

15  F0RHAT1S50) 

C 

ACCEPT"  ENTER  a  speaker  identifier  US  etc.) 

/  (  <  10  char  ):<15> 

READdl, 20)  ISFKRd) 

20  FORMAT  <  S 1 0 ) 

C 
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C***»«»  Print  a  heading  to  file  QUT1. 
C  contains  recognition  results  only!!! 
WRITE  (12,103)  ISAID(l) 

102  FORMAT  (20X, "SENTENCE  SPOKEN 

103  FORMAT  (S50) 


This  file 


,S50  n 


WRITE  (12,105)  ISPKR(l) 

104  FORMAT  (20X, “SPEAKER  WAS  s  ",S10  /> 

105  FORMAT  (S10) 

C 

C 

C 

C 

C 

C****#«  Noe  output  an  informative  heading  to  explain 
C  results  of  the  distance  measurement  routine. 

C 

WRITE! 12,207) I  DATE 

207  FORMAT! 15X, “THE  DATE  1S--“2I3,I5) 

WRITE (12,200) IHQUR,IMIN,ISEC 

208  F0RHAT(15X, “THE  TIME  IS--", 313  //> 

C 

Write  a  header  for  the  list's  of  output. 

C 


WRITE(12,720) 
WRITE(12,722) 
WRITE! 12,724) 


;send  the 
;header  out 
;now. 


F0RHAT(4X, “VECTOR", 3X, "FIRST", 2X, “SECOND", 3X, "THIRD", 
2X, “FOURTH", 3X, "FIFTH", ?X, "SCALE") 

F0RMAT(4X, "HUMBER", 2X, "CHOICE", 2X, "CHOICE" ,2X, "CHOICE 
2X, “CHOICE”, 2X, "CHOICE", 8X, "FACTOR") 


F0RMAT(4X, "*****»" ,2X  ,"**»*#*" ,2X, 
2X, "»»****", 2X, ”***»«*", 8X,"«»»*#« 


'*##**»" ,2X,' 
•  /) 


««*««* 


the  following  output  naKes  use  of 
information  contained  in  the  three(15)” 
files  listed  below.  This  output<15)" 
consists  of  the  five  'best'  choices<15)" 
of  a  matching  prototype  phonene<15)" 
for  each  time  frame  along  with  a<15)" 
a  set  of  statistics  on  the  phonene(15)" 
choices  for  each  frame. 

CORRAR  -  Contains  spectrographic<15)" 
distances  between  prototype  phonemes<15)" 
and  the  speechfile.  These  are  the(15)“ 


C  results  of  the  D 1ST  routine,  end  <15>” 

C  consist  of  raw  data  only.  (15X15)“ 

C  SF ACTOR  *  Contains  the  scale  factordb)" 

C  used  to  relate  two  identical  scoresd5)“ 

C  (say  100)  in  different  tine  franes.d5)” 

C  And  the  lower  the  value  of  this<15>" 

C  nueber  the  better  the  phoneae  natch, <1S>" 

TYPE  "  ” 

TYPE  "-Sentence  length  =  length  in  vectors*" 

TYPE  "Reaeaber  that  there  are  4  vectors/block  so  “ 

TYPE  “this  prograa  can  recognize  up  to  100  blocks" 

TYPE  “of  speech  at  one  tiae 

TYPE  “ . <15)“ 

C 

ACCEPT  “STARTING  BLOCK*" ,1START 
C 
C 
C 
C 

ACCEPT  "LENGTH  OF  SENTENCE  (in  vectors)  *", IVECT 
IVECT  *  IVECT  -  1  tad  just  so  don't  overread 

NUHPP*71 
IEND*0 
1BEC1N*0 
1FLAC*0 
IFLAG1*0 
IC0NT*1 
1SET*0 

1«!ENT1.*TUECT 

IF  (ISENTL.LE.200)  GO  TO  667 
ISENTL*200 
IC0NT*2 

666  DO  705  IC*1,I£0NT 

IF  (IC.EQ.ll  GO  TO  M7 
REMIND  1 
REMIND  3 
ISENTL*IVECT-200 
ISTART*4?+ISTART 
ISET*19fl 
M7  CONTINUE 

ISTBLX  *  0  ;NUNBER  OF  BLOCK  TO  READ  FIRST 
C 

Caeeeta  Calculate  nuaber  of  first  tine  increaent  used 
C  in  order  to  get  the  correct  value  out  of  RFACT0R. 

C 

ISAHP* ( (256.0/64 )  *  ISTART)  ♦  1  ;for  64  pt.  DFT 

C 
C 

The  following  loops  read  in  the  results  of  the 
C  recognition  routine  (PHDIST)  ,aaking  sure  that  the 
C  correct  values  of  neasureaent  distances  are  read 
C  by  setting  up  an  offset  as  shown. 

C 

Ceeeeee  Note:  The  way  in  which  the  results  are  stored 
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c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

c 


C 


in  CORRAR  is  as  follows.  Each  tno  block  grouping 
of  data  contains  the  distances  between  one  phonene 
(for  blocks  Oil,  the  phonene  is  the  1st  phonene; 
and  for  say,  blocks  445,  the  phonene  is  the  3rd 
phonene)  and  each  vector  of  the  sentence* 

So  for  a  speechfile  that  contains  340  vectors 
(  =  90  Blocks  of  sanpled  data)  of  infornation 
the  first  340  positions  of  each  two  block  set 
■entioned  above  contains  the  first  thru  the 
340th  distance  Measure  value  between  the  phonene 
specified  by  the  tno  block  grouping  and  each 
vector  of  the  speech  file* 


DO  500  1H= 1 ,NUHPP 
CALL  RDBLK( 1 ,ISTBLK,IBLK,2,IER3> 

IF(IER3.NE.l)  TYPE  “ERROR  ON  RDBLK,IER3:"  ,IER3 
IF(1ER3*EQ*9)  TYPE  “Check  the  nuaber  of  phoneaes  used" 
CALL  CHECK  (IER3) 


C 


;Now  read  in  distance  Measure  results* 


IOFFSET  =  I  SAMP  -1 

DO  300  1=1/ 1 SENTL 
INVAL  =  I  +  IOFFSET 
300  ISTOREd ,  I M ) 

ISTBLK  *  ISTBLK  ♦  2 
C 

500  CONTINUE 

C 
C 


;offset  used  to 
;get  correct  Measure 
;for  each  vector. 

;set  up  offset 

IBLK(INVAL)  ;qet  result* 

;do  next  phonene. 

;  END  of  input  loop*. ...... 


C****t«  For  each  tine  frane  (vector)  the  following  portion 
of  this  progran  will: 


1.  Scale  the  results  of  BIST  routine 

so  each  vector  has  a  0  to  100  relative 
score  on  the  best  choice  of  a  phonene 
for  recognition  purposes* 

2.  Deternine  the  “best"  five  prototype 
phonene  choices,  and  save  the  relative 
score  along  with  chosen  phonene* 

3*  Print  out  the  "best"  five  choices 

Mentioned  in  2,  along  nith  statistics 
of  the  accuracy  of  the  results. 


1 CHI VAL C J )  *  Value  (0  to  100)  of  the  first  choice 
phonene  for  a  given  tine  frane* 

ICH3PH0(J)  *  ID'FIER  of  the  phonene  which  is  the  third 


non  or>  ooor* 


c 

C««***e  Initialize  the  array's  for  the  default  choice. 

C 

DO  550  X-l jISENTL 

ICH1VAL(K)=0  ; 

ICH2VAKK)  =0  ? 

ICH3VAL ( K) =0  ; 

ICH4VAHK)  =  0  ; 

ICH5VAL(K)=0  \ 

C 

ICHIF'HO(K)  =  SYHB0L(71  >  ; 

ICH2PH0(K)=SYHB0L<71>  ; 

ICH3PH0(X)=SYHB0U71)  ; 

ICH4PH0(K) =SYHB0L<71)  * 

ICH5PH0(K)  =  SYHB0L(71 )  ; 

C 

550  CONTINUE 

C 
C 

C******  Determine  the  five  ''best'1  choices  for 

C  each  tine  fraae. 

C 

ISENT  =  ISENTL  -  2 

DO  705  J=1;1SENT  ;for  each  phoneme 

DO  700  1=1 ;NUHPP  ;and  each  vector. 

IST0RE<J,I)  =  IFI X ( FLOAT ( I8T0RE(J ) ) 732767  *  100) 


IFUSTOREI  J;I) .GT . ICHIVAL(J) )  GO  TO  640 
IF ( I ST ORE (J ,1) .GT.1CH2VAK J))  GO  TO  635 
I F ( I  STORE ( J , I ). GT. ICH3VALI J ) )  GO  TO  630 
IF < ISTORE < J ;I).GT.ICH4VALU))  GO  TO  625 
IF(ISTORE(J;I) .  GT.ICH5VAU  J) )  GO  TO  620 


700  CONTINUE 


C#**#»*  NOTE:  If  start  processing  in  the  middle  of 
C  a  file;  be  sure  that  you  start  reading  the  correct 
C  SFACTOR  values. 

C 

IF(J.GT.l)  GO  TO  710  ;only  do  it  once 

IF(ISTART.EQ.O)  GO  TO  710  ;skip  this  part 

C 

ISAHP2  *  ISAHP  -  1 
C 

DO  70?  KOI  =  l,ISAr.P2 

c 


P! 

■ -r  - 

m 

i 

• 

■ 

/■vr  ■ 
v V* 

V.' 

READ 

C 

(3,603)  RDUHPSF  ;a)ign  SF  file. 

709 

CONTINUE 

i 

C 

710 

CONTINUE 

/  N 

C 

READ 

(3,803)  SFACT0R  ;SFACT0R  (J) 

C 

i 

803 

C 

FORMAT ( FI 1 .8)  jfactor  1/0  fornat  st»t. 

C 

C 

-  •* 

c***»*« 

If  start  in  the  middle  of  a  file  adjust  the 

-v 

C  vector  value  printed... 

1 

C 

JVECTOR  *  J  ♦  ISAMP  -  1 

C 

C 

r‘._ 

C 

rr 

C 

(a 

c«****« 

C  tike 

Non  send  out  a  listing  of  inforeation  for  this 
f  raue. 

C 

^  ■ 

C 

,*  ’ 

IF  ( ( I CH 1 PH0 ( J ) ,NE. “01" ) . AND. ( ICH1PH0< J) ,NE« "71“) ) 

IFLAG*1 

'V 

IF  (IFLAC.EQ.0)  GO  TO  660 

n 

IF  (IFLAG1.EQ.1)  GO  TO  661 

IFLAG 1 « 1 

5; " 

IBEGIN= J+ISET 

661 

CONTINUE 

IF  ( (ICH1PH0( J) .NE/'OT') .AND, (ICHIPHO(J) .NE."71“)) 

IEND--J+ISET 

660 

CONTINUE 

C 

C 

WRITE (4^761)  JVECTOR, 

+ 

(ICHIPHO(J) ) ,ICH1VAL( J) ,  ( I CH2PH0 ( J ) > , ICH2VAK J)  , 

♦ 

( ICH3PH0 ( J ) ) , I CH3VAL ( J ) ,  (ICH4PH0(J) )  ,ICH4VAl(J) , 

o 
'  < « 

+ 

(ICH5PH0( J ) ) , ICH5VAL <  J ) ,SFACT0R 

•€ 

C 

'rf 

c 

v 

c 

c 

> 

c 

c 

761 

FORMAT (5X ,14 ,3X ,A2 ,”  " ,I3,2X,A2,”  ",I3,2X,A2," 

V3, 

/ 

2 X , A 2 , *'  ”,I3,2X,A2,"  “,I3,3X,f  11.8) 

c 

c 

c 

705 

r 

CONTINUE 

k  ( 

u 

60  TO  1000  sstop, 

•.V 

C 

C 
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C«***«»  The  following  subsets  set  the  correct  values 
C  of  1CH-VAL  and  ICH-PHO  depending  on  the  results  of 

C  the  search  for  "best"  five  phoneme  choices  at  each 

C  time  frame. 

C 

620  ICH5VAUJ)  =  I  STORE  ( J  ,1 )  {new  score. 

ICH5PH0(J)  *  SYMBOL C I )  {new  phoneme  « 

CO  TO  700  {continue 

C 
C 

625  ICH5VAM  J)  =  ICH4VAUJ)  ;shi 


I CH5PH0  <  J ) 
ICH40AM  J  > 
ICH4PH0( J) 
CO  TO  700 


1CH4PH0( J) 

I STORE ( J  ^  Z ) 
SYCIBOLU) 


{shift  down 
{scores  and  #'s. 
{new  score. 

{new  m. 

{continue. 


630  ICH5VAU  J)  =  ICH4VAK  J) 

ICH5PH0(J)  =  ICH4PH0( J) 

ICH4V AL ( J )  *  ICH3VAK  J) 

ICH4PH0 ( J )  ;  ICH3PH0 ( J ) 

ICH3VAK  J  )  *  I  STORE  ( J  ,1 ) 

I CH3PH0 ( J )  =  SYHBQKI) 

CO  TO  700 


{shift  down 
{scores  and  »'s. 
{shift 
{again. 

{new  score. 

{new  «. 

{continue 


ICH5VAUJ)  *  I CH4  VAL  ( J  > 


ICH5PH0  <  J  > 
ICH4VAL  <  J ) 
ICH4PH0( J) 
ICH3VAL ( J ) 
ICH3PH0  <  J ) 
ICH2VAKJ) 
ICH2PH0( J) 
CO  TO  700 


ICH4PH0 ( J ) 
ICH3VAK  J) 
ICH3PH0 ( J ) 
ICH2VAKJ) 

I CH2PH0 ( J  > 

I  STORE ( J  > I ) 
SYHBOL(I) 


{shift 


shift 

shift 

new  score, 
new  w. 
continue. 


ICH5VAU J)  -  ICH4VAK J) 


{shift 


ICH5PH0(J) 

ICH4PH0 ( J > 

% 

1CH4VAUJ) 

1CH3VAUJ) 

{shift 

ICH4PHQ(J> 

1CH3PH0(J) 

% 

* 

ICH3VAK  J ) 

ICH2VAK  J) 

{Shift 

ICH3PH0( J) 

ICH2PH0(J) 

• 

* 

ICH2VAKJ) 

1CH1VAH  J ) 

{shift 

I CH2PH0  <  J ) 

ICHIPHO(J) 

% 

ICH1VAUJ) 

I  STORE  <  J  ^ 1 ) 

{new  score. 

ICH1PH0 ( J , 

SYMBOL ( 1 ) 

{new  «. 

GO  TO  700 

{continue. 

C 

C 

c 

C*#***«  How  send  the  recognition  results  (best  choice 
C  for  each  vector)  to  the  file  0UT2.  Be  sure  to 
C  send  correct  vector  choice  to  the  right  file 
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u  u  u  o 


position  if  adding  to  a  file 


1000  J*1 

CLOSE  1 
CLOSE  3 
REWIND  4 

DO  1121  1 CNT  = 1 ,  IBEGIN 
READ (4/1130)  JVECTOR, 

♦  <ICH1PH0(J)),ICH1VAL<J),  <ICH2PH0< J) )  ,ICH2VAL( J) , 

+  < I CH3PH0 ( J > )  ,ICH3VAL(J)  ,  < ICH4PH04 J ) >  ,ICH4VAUJ> , 

♦  <ICH5PH0(J)),ICH5VAL(J),SFACTl)fi 

1121  CONTINUE 
1CAP=IEND-IBEGIN+1 

IF  (IGAP.LE.O)  GO  TO  1122 
DO  1122  ICNT  = 1 ,IGAP 
WRITE( 12 ,1 130)  JVECTOR, 

♦  (ICHIPHO(J)) ,ICH1VAL( J) ,  (ICH2PH0( J) )  ,ICH2VAL< J) , 

♦  ( 1CH3PH0 ( J) ) , ICH3VAL ( J  ) ,  ( ICH4PH0 ( J ) ) , ICH4VAK J )  , 

♦  <ICH5PH0(J)),ICH5VAL(J),SFACT0R 
READ(4 ,1130)  JVECTOR, 

+  (ICHIPHO(J) ),ICH1VAL(J),  (ICH2PH0( J) ) ,ICH2VAL( J) , 

+  ( I CH3PH 0 ( J ) ) , ICH3VALI J) ,  ( I CH4PH0 <  J ) ) , ICH4VAU J) , 

♦  t 1CH5PH0 ( J ) > ,ICH5VAL( J) ,SFACTOR 

1122  CONTINUE 

1130  F0RHAT(5X,I4,3X,A2, . ,I3,2X,A2,"  ",I3,2X,A2," 

/  2X,A2 " ,13,2X ,A2,“  ",I3,3X,F11.8> 

C 

1131  F0RnAT(14 ,A2,I3,A2,I3,A2,13,A2,I3,A2,I3,F11.8) 

C 

C 

1140  CALL  RESET 

C 

TYPE“<15)<15><15>“ 

TYPE” - RECOGNITION  RESULTS  ALONE  IN  FILE  '0UT1' - ” 

TYPE" - RESULTS  WITH  NOISE  REPIOVED  IN  FILE  '0UT2'- 

TYPE“< 15)< 15)  " 

STOP---- 

END 
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